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STRUCTURES  AND  ALGORITHMS  IN  STOCHASTIC  REALIZATION 
THEORY  AND  THE  SMOOTHING  PROBLEM 

By 

Faris  Badawi 


This  thesis  contains  two  main  topics,  each  of  which  is  connec¬ 
ted  to  the  stochastic  realization  problem.  First,  we  consider  some 
structural  and  algorithmic  problems  in  wide  sense  stochastic  realiza¬ 
tion  theory  which  also  have  applicability  to  many  problems  outside 
the  realm  of  stochastic  realization  theory  but  are  here  formulated  in 
that  framework.  We  consider  some  geometric  questions  concerning  the 
solution  set  of  the  positive  real  lemma  and  provide  a  Hamiltonian 
framework  for  the  non-Riccati  algorithms  of  Kailath  and  Lindquist; 
these  are  then  applied  to  the  stochastic  realization  problem.  Secondly, 
we  apply  the  basic  techniques  and  concepts  of  the  strict  sense  (proper) 
stochastic  realization  theory  of  Lindquist  and  Picci  and  Ruckebusch  to 
the  discrete-time  smoothing  problem.  This  provides  a  natural  inter¬ 
pretation  of  the  Mayne-Fraser  two-point  formula'  as  well  as  many  other 
smoothing  results,  the  interpretations  of  which  have  hitherto  been 
quite  unclear  from  a  probabilistic  point  of  view.  Hence  we  have  laid 
the  ground  work  for  a  theory  of  smoothing  which  has  so  far  been 
lacking. 
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INTRODUCTION 


The  stochastic  realization  problem  can  be  simply  stated  as 
follows:  Given  an  m-dimensional  stochastic  process  (y(t);  t  e  I), 
where  the  index  set  I  may  be  either  an  interval  of  the  real  line  or  a 
set  of  integers,  find  all  linear  stochastic  systems  (in  some  suitable 
class)  having  the  process  y  as  its  output  process.  These  stochastic 
systems  are  called  stochastic  realizations  of  y.  This  problem  is  of 
considerable  importance  in  stochastic  systems  theory  and  has  applica¬ 
tions  in  and  connections  to  many  fields  of  study,  among  which  are  net¬ 
work  theory  [8],  spectral  factorization  [3,8],  optimal  control  theory 
[6,11],  stability  theory  [4]  and  the  smoothing  problem  [64,65]. 

The  early  contributions  to  this  problem  are  due  to  B.D.O. 

Anderson  [3]  and  Faurre  [11],  the  first  of  whom  called  it  the  "inverse 
problem  of  covariance  generation."  In  these  early  papers,  the  sto¬ 
chastic  realization  problem  was  studied  from  a  deterministic  point  of 
view,  the  objective  being  to  determine  the  parameters  of  the  stochastic 
systems  rather  than  to  clarify  their  probabilistic  structures.  These 
early  results  have  been  extended  by  Clerget  [71]  and  Germain  [18]. 
Following  [2],  we  shall  term  these  aspects  of  the  stochastic  realiza¬ 
tion  problem  wide  sense.  These  problems  are  intimately  connected  to 
spectral  factorization  [12]  and  the  positive  real  lerrtm  [75-77]  (and  its 
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nonstationary  extensions),  the  set  of  all  state  covariance  matrices  P 
of  the  stochastic  realizations  of  a  stationary  process  being  the  solu¬ 
tion  set  of  the  positive  real  lemma.  The  set  P  is  also  the  solution 
set  of  the  Quadratic  Matrix  Inequality  [12,16],  and  a  certain  subset 
?o  of  P  contains  the  solutions  of  the  corresponding  Algebraic  Riccati 
Equation  [16]. 

More  recently  the  probabilistic  aspects  of  stochastic  realiza¬ 
tions  and  their  relation  to  Markovian  representations  have  been  studied 
in  various  aspects  and  degrees  of  completeness  by  Akaike[78,79] ,  Picci 
[80],  Lindquist  and  Picci  [2,53-56],  Ruckebusch  [1,10,58-60],  Lind¬ 
quist,  Picci  and  Ruckebusch  [57],  Pavon  [9,62]  and  Willems  and  van 
Schuppen[81] .  Here  one  is  interested  in  a  complete  probabilistic  de¬ 
scription  of  the  stochastic  realizations;  such  a  realization  will  be 
named  proper  [2]. 

In  this  thesis,  we  study  certain  aspects  connected  with  the  sto¬ 
chastic  realization  problem:  We  consider  some  structural  and  algorithmic 
problems  in  wide  sense  stochastic  realization  theory  and  the  applica¬ 
tions  of  proper  stochastic  realization  theory  to  the  smoothing  problem. 
However,  some  of  these  results,  in  particular  those  related  to  non- 
Riccati  algorithms  and  the  structure  of  the  set  P,  are  not  only  part  of 
stochastic  realization  theory,  but  have  wider  applicability. 

In  Chapter  1,  we  consider  wide  sense  stochastic  realization  theory 
for  stationary  processes  with  rational  spectral  densities  in  both  dis¬ 
crete-  and  continuous-time.  The  structures  of  the  sets  P  and  P„ 

o 

mentioned  above  are  studied.  A  parametric  representation  for  P  is  given 


and  its  boundary  points  are  characterized.  These  results  are  generali¬ 
zations  of  some  found  in  [2,11,18].  Then  we  show  that  the  elements  of 
PQ  are  extreme  points  of  P.  This  seems  to  be  a  well-known  result;  how¬ 
ever,  we  have  been  unable  to  find  a  proof  of  this  anywhere  in  the 
literature.  In  Sections  1.4  and  l.S,  we  apply  the  theory  of  Hamiltonian 
systems  to  obtain  a  new  derivation  of  the  non-Riccati  algorithms  of 
Kailath  and  his  coworkers  [21,29]  and  Lindquist  [22,23,27,83].  The 
basic  idea  of  this  proof  was  suggested  to  us  by  L.E.  Zachrisson.  The 
continuous-time  version  of  this  result,  presented  in  Section  1.4,  is 
quite  straight-forward  and  our  derivation  follows  [82]  closely.  As 
expected  the  discrete-time  version  is  considerably  more  complicated;  it 
is  presented  in  Section  l.S.  We  obtain  these  results  for  the  special 
type  of  Riccati  equations  that  arises  in  the  context  of  stochastic 
realizations;  our  results  on  the  general  case  will  be  presented  else¬ 
where.  Finally,  the  factorization  of  the  discrete-time  Riccati  equation 
presented  in  Section  l.S  is  applied  to  generate  realizations  of  y. 

These  results  are  the  continuous-time  counterpart  of  Section  6  in  [2] . 

While  the  study  of  the  proper  stochastic  realization  problem  is 
of  interest  in  itself,  its  concepts  and  techniques  can  be  applied  to 
other  problems,  an  example  of  which  is  smoothing.  In  Chapter  2,  a 
smoothing  theory  for  discrete-time  r.onstationary  systems  is  developed, 
much  in  analogy  with  our  continuous -time  papers  [64,65].  It  is  shown 
that  the  smoothing  estimate  is  contained  in  a  finite-dimensional  space 
H®,  the  frame  space  [53-56].  Unlike  the  situation  in  continuous-time, 
in  the  discrete  setting,  is  not  of  constant  dimension,  contributing 
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to  the  fact  that  discrete-time  problems  are  not  just  trivial  modifica¬ 
tions  of  their  continuous -time  counterparts .  In  the  continuous-time 
setting  [64,65],  the  invertibility  of  certain  covariance  matrices  is 
essential.  In  discrete-time,  this  invertibility  does  not  hold  on  the 
whole  interval.  Hence  we  apply  the  generalized  Moore-Penrose  pseudo¬ 
inverse  [13,66] ,  thereby  introducing  some  further  structure  in  the 
theory.  In  Section  2.6,  we  derive  a  two-filter  formula  of  the  Mayne- 
Praser  type  for  the  smoothing  estimate.  This  formula  is  in  terms  of 
two  estimates:  the  first  (x*)  is  generated  by  the  usual  forward  Kalman- 
Bucy  filter  and  the  second  (x*)  by  the  forward  counterpart  of  the 
backward  Kalman-Bucy  filter.  This  provides  further  insight  into  the 
classical  theory  of  smoothing.  In  the  final  Section  2.7  we  use  a 
different  technique  to  derive  the  smoothing  formulas  of  Bryson  and 
Frazier  [35]  and  Rauch,  1\mg  and  Striebel  [36],  which  does  not  employ 
the  frame  space,  but  an  orthogonal  decomposition  of  the  closed  linear 
span  of  (y(t);  t  c  1}  much  along  the  same  lines  as  the  procedure  used 
in  [9]  to  solve  the  stationary  stochastic  realization  problem.  Unfor¬ 
tunately,  due  to  time  constraints,  we  have  not  had  time  to  tie  up  all 
the  loose  ends  of  this  theory,  and  as  explained  in  the  text,  some  pro¬ 
blems  have  been  left  open,  which  we  feel  otherwise  would  have  been 
resolved  with  a  moderate  amount  of  extra  work  since  all  the  ingredients 
needed  are  at  hand.  We  shall  have  to  return  to  this  in  a  subsequent 
paper. 

Chapter  3  is  devoted  to  a  study  of  some  topics  on  the  stochastic 
realization  problem  for  continuous -time  nonstationary  processes  defined 
on  the  real  line.  One  aim  is  to  generalize  the  finite  interval  theory 
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presented  in  [64,65].  As  a  by-product  we  obtain  a  natural  stochastic 
interpretation  of  an  algorithm  due  to  Clerget  [71]  for  the  minimum  and 
maximum  variance  realizations.  Then,  the  non-Riccati  algorithm  of  [2] 
(and  Section  1.6)  is  generalized  to  this  nonstationary  setting.  All 
the  results  of  the  thesis  are  for  the  so-called  regular  case  (i.e. , 
there  is  a  full -rank  observation  noise),  but  many  of  them  can  be 
generalized  to  the  nonregular  setting.  To  aid  the  reader  in  doing  the 
necessary  conversions  for  this,  we  have  included  a  section  (Section  3.6) 
providing  the  necessary  transformations.  Our  results  here  are  general¬ 
izations  of  the  stationary  counterparts  in  [18],  in  which  paper,  a  con¬ 
trol  theory  approach  is  taken.  Our  results  should  be  compared  with 
those  of  Anderson  et  at  [74]. 

Sections  1.4  and  l.S,  and  Chapter  2  are  based  on  joint  work  with 
Professor  Anders  Lindquist. 

We  shall  adopt  the  following  notations  in  the  sequel.  The  trans¬ 
pose  of  a  matrix  is  denoted  by  (’).  E{*}  is  the  mathematical  expecta¬ 

tion,  I  is  the  unit  matrix.  All  vectors  without  prime  are  column 
vectors.  If  R  is  a  symmetric  matrix,  R  >  0  (R  £  0)  means  that  R  is 
positive  (nonnegative)  definite.  If  R  a  0,  Ra  is  the  unique  nonnega¬ 
tive  square  root  of  R.  6  is  the  Kronecker  symbol.  The  set  of  inte¬ 
gers  will  be  denoted  by  Z;  Z+  will  denote  {0,  1,  2,  ...}.  Finally, 


the  set  of  real  numbers  will  be  denoted  by  IR. 


CHAPTER  1 


RICCATI  AND  NON-RICCATI  METHODS  IN  STOCHASTIC  ■ 

REALIZATION  THEORY  OF  STATIONARY  PROCESSES 

1.1.  Stochastic  Realization  Theory:  A  Review 

In  this  section,  we  shall  review  certain  facts  from  stochastic 
realization  theory  for  both  discrete  -  and  continuous  -  time  stationary 
processes.  The.  discrete-time  case  will  be  presented  first. 

Let  {y(t);  t  e  I},  where  Z  is  the  set  of  integers,  be  an 
m-dimensional  centered  stationary  and  purely  nondeterministic  stochastic 
process  defined  on  an  underlying  probability  space.  The  process  y  is 
said  to  have  an  n-dimensional  Markovian  representation  [1]  if  there 
exists  an  n-dimensional  stochastic  process  (x(t);  t  e  Z},  which 
together  with  y  satisfy  a  linear  stochastic  system 

x(t+l)  *  Ax(t)  +  Bw(t)  (1.1a) 

y(t)  =  Cx(t)  +  Dw(t)  (1.1b) 

where  A,  B,  C  and  D  are  constant  matrices  of  dimensions  n  x  n,  n  x  p, 
m  x  n  and  m  x  p  respectively,  A  is  a  stability  matrix  i.e.  all  its 
eigenvalues  are  strictly  inside  the  unit  circle  (for  short,  | X (A) |  <  1), 
and  (w(t);  t  e  Z}  is  a  p-dimensional  white  noise,  i.e.  a  zero-mean 
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stochastic  process  of  covariance  E{w(t)w(s)'}  ■  The  process  x  is 

called  the  state  of  the  system;  it  is  stationary  with  constant  covariance 

E{x(t)  x(t)‘>  *  P  »  P'  i  0  (1.2) 

which  clearly  satisfies  the  Liapunov-type  equation 

P  »  APA»  ♦  BB'.  (1.3) 

The  process  y  is  called  the  output  of  the  system  (1.1)  and  w  is  the 
input. 

The  covariance  function  K^(s):  *  E{y(t+s)y(s)  *}  of  the  output  pro¬ 
cess  y  of  (1.1)  is  easily  seen  to  be 

K  (s)  *  CAS-1G  1  +  G'A’ ~S”1  C»1  +  (CPC'  +  DD'}6  ,  (1.4) 

/  S  “S  SO 

where  G  *  APC*  +  BD'  and  1  *  1  if  s  >  0  and  0  otherwise.  Then  the 

s 

spectral  density  function  $(z)  of  y  is  given  by 
00 

*(2)  »  l  K  (s)z"S  *  C(zI-A)-1G  +  G,(z'1J-A‘)"1C'  +  (CPC’  +  DD'),  (1.5) 
s*-»  ' 

which  is  a  rational  function  in  z.  It  is  easy  to  see  that  <&(z)  has  the 
following  properties 

(i)  each  element  of  $  is  analytic  on  the  unit  disc:  \z\  si, 

(ii)  4>(z)  =  $(z"*)'  and 

(iii)  $(e*W)  s  0  for  all  w  €  ]R. 

The  stochastic  realization  problem  is  the  inverse  of  the  above  con¬ 
struction  i.e.  from  the  knowledge  of  the  spectral  density  of  a  process, 
we  wish  to  determine  all '  representations  (1.1)  of  the  process. 
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More  specifically,  let  (y(t);  t  e  Z}  be  as  above.  Let  the  spectral 
density  $(z)  of  y  be  given.  Assume  $(z)  has  the  above  properties  (i.e. 
it  is  rational  and  satisfies  (i)-(iii)  above.)  In  addition,  assume  that 
$(e*w)  >  0  for  all  ui  e  H  and  that  0  <  $(“)'  <®  (the  significance  of 
these  assumptions  will  be  made  clear  when  the  need  arises.)  The  problem 
is  to  find  all  Markovian  representations  (1.1)  with  n  *  dim  A  minimal 
and  whose  outputs  have  the  same  spectral  density  $  as  that  of  the  given 
process  y.  Such  a  representation  will  be  called  a  wide  sense  stochastic 
realisation  [2]  of  y,  although  it  might  be  more  descriptive  to  call  it  a 
realization  of  $.  In  fact,  this  is  a  deterministic  problem  which  re¬ 
quires  determining  all  quadruplets  [A,  B,  C,  D]  from  the  knowledge  of  $. 
The  probabalistic  problem  of  finding  all  proper  [2]  stochastic  realiza¬ 
tions  (i.e.  all  systems  (1.1)  whose  outputs  not  merely  have  the  same 
covariance  properties  as  the  given  process,  but  are  equal  to  it  a.s.) 
will  be  discussed  in  Chapter  2  in  the  nonstationary  setting. 

The  wide  sense  stochastic  realization  problem  is  equivalent  to  the 
classical  spectral  factorization  problem  [3]:  given  $(z),  find  all 
minimal  stable  spectral  factors  of  $  i.e.  all  matrices  W(z)  of  proper 
real  rational  functions  of  minimal  McMillan  degree  [4]  with  all  poles 
inside  the  unit  circle  and  satisfying 

$(z)  =  W(z)  W(z-1) '  .  (1.6) 

To  see  that  this  is  the  case,  first  observe  that  if  [A,  B,  C,  D]  is  a 
minimal  realization  of  y,  then 

W(z)  =  C(zl-A) -1B  +  D 


(1.7) 
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is  a  minimal  stable  spectral  factor  of  4.  Conversely,  any  such  minimal 
spectral  factor  W,  gives  rise  to  a  whole  class  of  wide  sense  stochastic 
realizations  of  the  form 

[T_1AT,  T_1B,  CT,D]  (1.8) 

where  [A,  B,  C,  0]  is  a  minimal  realization  [4]  of  W  and  T  is  an  arbi¬ 
trary  nonsingular  n  x  n  matrix. 

Using  the  method  of  partial  fractions,  4(z)  can  be  written 

4(z)  =  S(z)  +  S(z‘V  (1.9) 

where  S  is  a  discrete  positive  real  rational  function  [5].  Since  S  is 

» 

proper(i.e.  S(°»)  <  «) ,  it  has  a  minimal  realization  [F,  G,  H,  J] ,  i.e. 

S(z)  *  H(zI-F)**1G  +  J  (1.10) 

for  some  constant  matrices  F,  G,  H  and  J  of  dimensions  n  x,  n,  n  x  m, 
m  x  n  and  m  x  rc  respectively,  where  n  is  the  McMillan  degree  [4]  of  S. 
Hence,  |X(F)|<  1, (F,  G)  is  controllable  and  (H,  F)  is  observable. 
Several  procedures  are  available  for  determining  [F,  G,  H,  J]  [6,7], 
which  is  unique  up  to  the  equivalence  (1.8).  Using  the  fact  that  S  is 
discrete  positive  real  and  the  Positive  Real  Lemma,  Anderson  [5,8]  has 
shown  that  all  wide  sense  stochastic  realizations  of  y  are  given  by 

[A,  B,  C,  D]  =  [T_1FT,  T“1(B1,  B2)  V,  HT,  (R(P)%,  0)V]  (1.11) 

where  T  is  as  above,  V  is  a  p  x  p  constant  orthogonal  matrix,  is 

n  x  m  and  B2  is  n  x  (p-m),  P  is  n  x  n  symmetric  positive  definite  matrix 
which  together  with  B^,  B2  and  R(P)  satisfy 


wnp.i*r 


n 
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P  »  FPF»  +  B1BJ  +  B2B^  (1.12a) 

G  *  FPH*  ♦  BjR(P)**  (1.12b) 

R(P)  *  J  +  J'  -  HPH'  .  (1.12c) 

Since  0  <  $(«)  <  •,  and  assuming  dim  F  *  n  i  1,  it  is  easy  to  show  that 
F  is  nonsingular  [9],  R(P)  >  0  [9,10]  and  that 

R(P)  »  G'F'^H'  +  $(«)  -  HPH*  .  Cl- 13) 

It  is  no  restriction  to  take  T  ■  V  ■  I  i.e.  to  consider  realizations  of 
the  form 

x(t+l)  »  F  x(t)  +  B1u(t)  +  B2v(t)  Cl-  14a) 

y(t)  -  H  x(t>  ♦  R(P)Js  u(t)  ,  (1.14b) 

where  w  =  [“] . 

Let  P  *  {P  |  P  solves  (1.12)}.  For  each  P  c  P,  define 

A(P)=  -P  +  FPF*  +  (G-FPH^RCP^^G-FPH')'  ,  (1.15) 

and  let  PQ  *  {P  e  P  |  A(P)  *  0} 

In  the  following  preposition,  we  collect  some  facts  from  Anderson  [5], 
Faurre  [11]  and  Pavon  [9]. 

Proposition  1.1.  The  set  P  is  convex  and  compact  and  there  are  two 
elements  P*  and  P*  in  P^such  that  P*  s  P  s  P *for  all  P  e  P.  Moreover , 

P  =  {P | A(P)  s  0}.  Finally,  PQ  is  the  set  of  all  solutions  of  (1.12)  for 
which  B  =  0. 
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Remark.  In  the  proper  stochastic  realization  setting  (to  be  discussed 
in  Chapter  2),  PQ  has  an  interesting  interpretation;  these  realizations 
are  the  internal  ones  (i.e.  those  which  can  be  constructed  in  terms  of 
the  given  process  without  introducing  exogeneous  noise) . 

The  minimum  P*  and  the  maximum  P*  are  of  particular  interest.  The 
following  matrix  Riccati  equations,  given  in  (11],  may  be  used  to 
calculate  them. 

Proposition  1.2.  Let  (II(t);  teZ+)  and  {n (t) ;  t  £  Z+)  be  the  solutions 
of  the  n  x  n-matrix  difference  equations 

n(t+l)  -  net)  *  A(H(t))  ;  lt(0)  *  0  ,  (1.16a) 

n(t+i)  -  ft(t)  «  A(iict))  ;  !i(o)  *  o  (i.i6b) 

respectively,  where  A  is  given  by  (1.15)  and  A  by 
A(P)  *  -P  +  F'PF  +  (H»  -  F’PG) (J  +  J»  -  G'PG)_1(H*  -  F'PG)\  (1.16c) 
Then  H(t)  •*  P*  and  H(t)"1  +  P*  as  t  +  °°  . 

Remark.  Equation  (1.16a)  has  an  immediate  stochastic  interpretation  in 
terms  of  the  Kalman  filter.  Consider  an  arbitrary  realization  of  the 
form  (1.14)  with  state  covariance  P.  The  linear  least  squares  estimate 
&(t)  of  x(t)  given  the  data  {y(0),  y (1) , . . .  ,y(t-l)}  is  generated  by 
the  Kalman  filter 

S(t+1)  =  Fx(t)  +  K(t)R(t)-3s[y(t)  -  H£(t)]  ;  x(0)  *  0  ,  (1.17) 

where  K  is  the  Kalman  gain  and  is  given  by 
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K  -  [FZH*  +  R(P)%]R_Js  ,  (1.18b) 

R  -  H2H»  +  R(P)  (1.18c) 

and  l  is  the  error  covariance  Z(t)  *  E{[x(t)  -  fc(t)][x(t)  -  fc(t))'} 
which  satisfies  the  matrix  Riccati  difference  equation 

£(t+l)  «  FE(t)F’  -  K(t)  K(t) '  +  BB’  ;  E(d)  -  P  .  (1.18d) 

Set  HCt) :  »  E{x(t)x(t) Then  II(t)  *  P  -  E(t)  which  inserted  in  (1.1 8d) 
implies  that  II  satisfies  (1.16a)  and  that 

K  «  (G  -  FnH')R(H)-,S.  (1.1 8e) 

Hence,  by  the  above  proposition  II (t)  -*■  P*  and  K(t)  B*  as  t  ■♦*  »,  i.e. 

P*  and  B*  can  be  regarded  as  the  state  covariance  and  the  gain  of  the 
steady-state  Kalman- Bucy  filter. 


In  introducing  the  continuous-time  version  of  the  stochastic  reali¬ 
zation  problem,  we  shall  closely  follow  the  presentation  in  [2],  Let 
(y(t) i  t  e  R}  be  a  mean-square  and  purely  nondeterministic  m-dimensional 
stochastic  process  with  stationary  increments  and  zero  mean.  Then  there 


r  elh>t-l 

L  iw 


am 


exists  an  orthogonal  stochastic  measure  d?  such  that  y(t) 
and  E{d£((i>)  d?(w)+}  *  <t(iw)dw.  (Here  +  denotes  conjugation  and  trans¬ 
position.)  The  m  x  m-matrix  of  real  functions  is  the  spectral  density 
satisfying  (i)  each  element  of  $  is  analytic  on  the  imaginary  axis, 

(ii)  $>(s)  =  $(-s)',  (iii)  $(iw)  s  0  for  all  w  e  ]R  and  (iv)  $(»)  < 
Furthermore,  $  is  assumed  to  enjoy  the  additional  properties  that 
R  :  ■  $(»)  is  positive  definite  (the  singular  case  will  be  studied  in 


Chapter  3)  and  that  $(iw)  >  0  for  all  w  e  R.  Analogously  to  the 
discrete-time  case,  $  can  be  v:ritten 
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*(s)  -  Z(s)  +  Z(-s) '  ,  (1. 19) 

where  Z  is  a  positive  real  function  [8].  Let  [F,  G,  H,  R]  be  a  minimal 
realization  [4]  of  Z.  ThenReU(F)}  <0,  (F,  G)  is  controllable,  and 
(H,  F)  is  observable. 

Now,  the  problem  is  to  find  all  representations  of  the  type 

dx  *  AX  dt  +  Bdw 
dy  *  Cx  dt  +  Ddw  , 

such  that  the  output  y  has  spectral  density  4>  ,n  *  dim  A  minimal  and 
Re{X(A)}'  <  0. 


Modulo  a  trivial  coordinate  transformation  in  the  state  space,  all 
solutions  to  this  problem  are  of  the  form 


dx  *  F  -X  dt  +  Bjdu  +  B2dv 

(1.20a) 

w 

dy  »  H  x  dt  +  R  du  , 

(1,20b) 

where  B  *  (B^B^  and  P  »  E{x(t)  x(t) '}  satisfy  the  Positive 

Real  Lemma 

Equations 

FP  +  PF’  +  B1B1'  +  B2B2‘  *  0 

(1.21a) 

G  »  PH'  +  B^*5 

(1.21b) 

P  *  P'  >  0  (n  x  n-matrix)  . 

(1.21c) 

Let  P  =  (P  |  P  solves  (1.21)}.  For  each  P  £  P,  define 

A(P) 

a  FP  +  PF'  +  (G  -  PH')R"1(G  -  PH')'  . 

(1.22) 

Then  P  *  (P  *  P1  >  0  |  A(P)  £  0}  [2] .  Let  PQ  *  {PeP  |  A(P)  ■  0>.  Then 
Proposition  1.1  holds  for  this  setting  also  [11].  Moreover,  in  analogy 
with  the  discrete-time  case,  this  problem  can  be  seen  to  be  equivalent 
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to  finding  all  minimal  stable  spectral  factors  of  the  type 

W(s)  W(-s) '  «  *(s)  .  (1.23) 

The  continuous -time  counterpart  of  Proposition  1.2  is 

Proposition  1.3.  Let  H  and  S  be  the  unique  solutions  of  the 
differential  equations 

net)  *  A(n(t))  ,  n(o)  *  o 

and 

n(t)  *  A(ii(t))  ,  ii(Q)  «  o 

respectively ,  where  A  is  given  by  (1.22)  and  A  by 

A(P)  »  F*P  +  PF  ♦  (H*  -  PG)R-1 (H*  -  PG)'  . 

Then  H(t)  ■+■  P*  and  n(t)-1  +  P*  aa  t  +  «. 

Remark.  As  was  remarked  (after  Proposition  1.2),  equation  (1.24a)  has 
the  interpretation  that  II (t)  *  E{X(t)  fc(t)’},  where  £(t)  is  the 
linear  least  squares  estimate  of  the  state  process  x(t)  of  the  system 
(1.20)  given  the  record  (y(s);  0  £  s  £  t}  which  is  generated  by  the 
Kalman-Bucy  filter 

dx  =  F  k  dt  +  K(t)R”^[dy  -  H  i  dt];  x(G)  =  0  , 
where  the  Kalman  gain  K  is  given  by 


n  *  n-matrix 

(1.24a) 

(1.24b) 


K  =  (G  -  IIH^R"^  . 


(1.24c) 
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1.2,  Structure  of  the  Set  P:  Continuous-Time 


Since  each  wide  sense  stochastic  realization  is  determined  by  its 
covariance  matrix  P,  an  investigation  of  the  structure  of  the  set  P 
of  all  such  matrices  is  deemed  necessary.  In  this  section,  we  shall 
exploit  the  role  played  by  a  Hamiltonian  matrix  to  be  defined  below  to 
provide  some  new  links  between  the  solutions  of  what  is  known  as  the 
Algebraic  Riccati  Equation  (ARE)  (the  solution  set  of  which  is  Pq)  and 
those  of  a  Quadratric  Matrix  Inequality  (QMI)  with  solution  set  P.  The 
boundary  and  extreme  points  of  P  will  be  studied. 

Let  P+  »  {P  e  P  |  P  >  P*}  and  P_  *  {P  e  P  |  P  <  P*},  where  P*  and 
P*  are  the  minimum  and  maximum  elements  of  P  defined  in  Section  1.1. 

Since  <t(iw)  >  0  for  all  real  o>,P*  -  P#  >  0  [12;  p.  360],  and  consequently 
P+  and  P_  are  both  nonempty.  For  each  P  e  P,  define  the  feedback  matrix 


r  -  F  -  (G  -  PH’)R  H  . 


(1.25) 


Let  the  feedback  matrices  corresponding  to  P*  and  P*  be  denoted  T*  and  Y* 
respectively.  It  .can  be  shown  that  Re{X(F*)}  <  0  and  Re{X(r*)}  >  0 
(12;  p.  360],  [11;  p.  53].  Finally,  from  the  given  matrices  F,  G,  H  and 
R,  construct  the  2n  x  2n-matrix 


(F  -  GrA)*  -  H’R_1H 


GR’V 


(F  -  GrA) 


(1.26) 


(the  significance  of  which  will  be  clear  shortly.)  It  is  trivial  to  see 


that  F  is  a  Hamiltonian  matrix  i.e.  F  *  TFI * ,  where  I  * 


”°  A 
LA  °J 


Consequently,  if  X^  ;  i  *  l,2,...,n  is  an  eigenvalue  of  F,  so  is  -  X^ 


[13] .  It  can  also  be  shown  that  F  must  have  no  purely  imaginary  eigen¬ 
values  [14]. 

In  the  following  proposition,  we  collect  some  facts  from  Brockett 
[4] ,  Faurre  [11] ,  MacFarlane  [14] ,  Martensson  [IS]  and  Willems  [16] . 

Proposition  1.4.  There  ia  one  and  only  one  P  t  Pfl  with  Re{A(r)}  <  0 
(Re{A(r)}  >  0),  namely  P*(P*).  Moreover,  the  eigenvalues  of  the  corre¬ 
sponding  feedback  matrix  r*(r*)-  are  the  n  eigenvalues  of  F  with  negative 
(positive)  real  parts. 

The  following  lemma  will  be  needed  in  this  section. 

Lemma  1.5,  Let  P^  and  P2  be  arbitrary  elements  of  PQ  and  let  and 
r2  be  the  corresponding  feedback  matrices  (1.25).  Then 

I^AP  +  APiy  •  0  ,  (1.27) 

where  AP  *  P1  -  P^. 

Proof:  Since  P^  and  P2  belong  to  PQ,  A(Pp  *  0  and  A(P2)  3  0.  Sub¬ 
tracting  the  second  from  the  first  and  adding  and  subtracting  the 
quantity  P^H,R'*1HP2,  we  obtain  (1.27).  □ 

As  a  first  corollary  to  the  above,  we  can  easily  prove  some  of  the 
statements  of  the  previous  proposition. 

Corollary  1.6*  The  feedback  matrices  T*  and  -T*'  are  similar.  ( Conse¬ 
quently ,  if  A^  ;  i  *  l,2,...,n  arc.  the  eigenvalues  of  then 
-A.^  ;  i  3  l,2,...,n  are  the  eigenvalues  of  T*.J 


Proof.  Since  4(iu)  >  0  for  all  u  «  R,  P*  -  P*  >  0.  By  the  above  lemma, 
r*(P*  -  P*)  ♦  (P*  -  P*)r*'  *  0.  Hence  F*  -  -(P*  -  Pjr*»(P*  -  P*)"1.  □ 

Now,  we  turn  our  attention  to  the  other  solutions  of  the  Algebraic 
Riccati  Equation  :  A(P)  >0 

For  an  arbitrary  n  x  n-matrix  M  with  n*  eigenvalues  with  positive 
real  parts  and  n”  eigenvalues  with  negative  real  parts,  let  L*(M)  and 
L~(M)  denote  the  invariant  subspaces  spanned  by  the  corresponding 
(generalized)  eigenvectors . 

Lemma  1.7  (J.  C.  Willems  [16]). 
feedback  matrix  (1.2S).  Then 

Ta  *  r*a  , 
and 

rb  -  T*b  , 

The  following  corollary  is 
lemma. 

Corollary  1.8,  Let  P  and  r  be  as  in  Lemma  1.7.  Then  the  eigenvalues 
of  T  are  among  those  of  and  F*  fi.e.  among  those  of  '•  (In  parti¬ 
cular  t  the  feedback  matrix  corresponding  to  any  solution  P  e  Pq  has  no 
purely  imaginary  eigenvalues. ) 

The  above  corollary  is  in  agreement  with  the  well  known  result  of 
Potter  [17]  (generalized  in  [15]  to  the  case  of  nondistinct  eigenvalues) 
that  all  solutions  of  the  (ARE)  may  be  obtained  from  the  eigenvectors 
corresponding  to  the  eigenvalues  of  the  Hamiltonian  matrix  F. 


Let  P  e  ?Q  and  T  the  corresponding 


for  a  e  L"(r) 


(1.28a) 


for  b  e  L  (T) 


(1.28b) 


a  trivial  consequence  of  the  above 
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The  next  corollary,  which  holds  under  a  natural  and  standard 

assumption  that  F  can  be  transformed  to  Jordan  form,  provides  some  more 

information  about  the  feedback  matrix  T  corresponding  to  any  solution 

P  c  P  . 
o 

Corollary  1.9.  Assume  the  Hamiltonian  matrix  F  has  distinct  eigenvalues . 
Let  X  be  cat  eigenvalue  of  the  feedback  matrix  T  corresponding  to  an 
arbitrary  element  P  of  ?  .  Then  -X  cannot  be  an  eigenvalue  of  Y. 

Proof.  Let  [F,  B,  H,  R*]  be  the  (unique)  realization  corresponding  to 
P  (since  P  €  PQ  ,  ^  *  °) ,  which  gives  rise  to  the  spectral  factor 

W(s)"1-  »  -R"Sl(sI  -  T)”1  BjR **  +  R-**  , 

which  can  be  written  —  ^--^(s) ,  where  Xp  is  the  characteristic  polynomial 
of  T  and  M  is  a  matrix  polynomial.  Therefore,  by  Cramer's  rule,  Xp 
equals  the  numerator  of  det  W,  and  consequently,  in  view  of  (1.23), 

Xp(s)  Xp(-s)  *  4>(s),  where  <f>  is  the  numerator  polynomial  of  det  $.  But, 
since,  in  particular,  this  relation  holds  for  r  *  T*  and  since 
Xp  C-s)  »  Xp*Cs)  (Corollarly  1.6),  <j>  must  be  the  characteristic  poly¬ 
nomial  Xp  of  F  (Proposition  1.4),  i.e. 

Xp(s)  Xp(-s)  =  Xp(s). 

Now  suppose  X  and  -X  are  eigenvalues  of  T.  The  (s  -  X)(s  +  X)  is  a 

2  2 

factor  of  both  Xp(s)  and  Xp(-s).  Consequently,  (s  -  X)  (s  +  X)  is  a 
factor  of  Xp(s) >  which  is  clearly  a  contradiction  to  the  assumption  that 
F  has  distinct  eigenvalues.  0 
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In  particular,  we  are  able  to  see  that  the  sets 
P£  «  {P  «  PQ  j  P  >  P*}  and  P~  «{P  €  ?Q  |  P*  <  P*}  are  singletons. 

Corollary  1.10.  An  n  *  n  symetrio-  matrix  P  belongs  to  P*  (P~  )  if  and 
only  if  the  feedback  matrix  V  corresponding  to  P  is  similar  to  -TJ  (T#) . 

Proof.  Let  P  c  P+  .  Then  P  >  P*.  Hence,  by  (1.27), 
o 

T*-(p  -  P*)r^  (P  -  P#)”*.  Conversely,  if  V  is  similar  to  -i'J  , 
then  T  has  all  eigenvalues  in  the  right  half  plane.  But,  by  Proposition 
1.4,  there  is  only  one  such  feedback  matrix,  which  is  T*:  the  one 
corresponding  to  P*.  By  the  assumption  $(iw)  >  0,  P*  ••  P*  >  0  i.e. 

P*  e  P+  .  The  proof  of  the  other  part  is)  analogous .  □ 

Indeed,  Corollary  1.10  may  be  reformulated  as:  P+  *  {P*>  and 

p;  -  {p.>. 

Next,  we  shall  discuss  the  relationships  between  the  solutions  of 
(ARE)  :*A(P)  *  0  and  (QMI)  :  A(P)  £  0. 

For  any  e  >  0  and  any  matrix  M,  define  the  ball  U(M,e)  *  {L  :  L  * 

M  +  N,  || N  ||  <  e),  where  |j  •  ||  is  the  usual  matrix  norm  associated  to  the 
vector  Euclidean  norm. 

Definition  1.11.  An  n  x  n  syrametrix  matrix  P  belongs  to  the  boundary  of 
P  (denoted  by  3P)  if,  for  all  e  >  0,  there  exist  two  matrices  P^  and  P2 
belonging  to  U(P,e)  such  that  P^  €  P  and  P2  fc  P.  (Since  P  is  closed, 

3P  c  p.) 
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The  following  theorem,  which  provides  a  complete  characterization 
of  the  boundary  points  of  P,  is  due  to  Germain  [18]. 


Theorem  1.12*  Let  P  e  V  and  set  - 0  «  FP  P  F»  and  S  •G-PH'. 

O  ^  0  0  0  0 

ft  Sc 


Than  P  e  3P  if  and  only  if  the  matrix  14  »  ei 
o  o  S’ 

_c 


is  singular. 


As  a  corollary,  we  obtain  the  following  result  linking  PQ  with  P. 
A  sharper  result  will  be  given  later  in  this  section. 


Corollary  1.13.  PQ  e  a p. 


Proof.  Let  Pq  fi  PQ.  Then  A(PQ)  *  0,  consequently,  the  matrix  Mq  (see 
Theorem  1.12)  is  singular.  □ 

In  fact,  if  m  <  n  (which  is  usually  the  case  in  application),  we 
have  a  stronger  result,  namely 


Corollary  1.14.  Let  m  <  n  and  let  and  P2  be  arbitrary  elements  of 
PQ.  Then  the  segment  [P^,  P^]  is  contained  in  3P.  (In  particular , 

[P*,  P*]  <=  3P.) 


Proof.  Let  a  e  [0,1]  and  define  P(a)  =ap1  +  (i  -a)P  .  Then  A(P(a)) 
may  be  written 

A(P(o))  *  oA(P1)  +  (1  -  a)A(P2)  -  a(l  -  a) (?1  -  P^H'R'Wj.  -  P2).  (1.29) 

Since  P^  P2  e  P  ,  A(Pp  =  A(P2)  =0.  Let  Q(a)  «  -FP(a)  -  P(a)F»  and 
S(a)  3  g  -  P(a)H*.  Then  it  is  easy  to  see  that 
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rA(P(cO  -  Q(a)  -  S(a)R"1S(a) •  *  ot(l  -  a)(?1  -  P2)  H»R"1H(P1  -  P2). 

.1  lQ(oi)  S(af| 

If  m  <  n,  H'R  H  is  not  full  rank  and  hence  the  matric  M(a)  «L^t  R 

is  singular.  Then  P(a)  e  3P  Vas  [0,1].  □ 

The  final  task  of  this  section  is  to  prove  that  solutions  of  the 
CARE)  are  extreme  points  of  the  set  of  solutions  of  the  (QMI) .  This  is 
a  much  stronger  result  than  Corollary  1.13.  (The  extreme  points  of  a 
set  are  contained  in  its  boundary.)  To  this  end,  we  shall  need  the 
following  lemma. 

Lenina  1.15.  Let  P  be  an  arbitrary  element  of  Pq  .  Suppose  there 
exist,  two  elements  P^  and  P2  belonging  to  P  such  that 
P  »  aP^  +  (1  -  a)P2  for  some  a  e  (0,1).  Then,  Pj^  e  P  ,  P2  e  Pq  and 
APH'R~1HAP:*  0,  where  AP  »  PJ  -  P2. 

Proof.  Let  P,  Pj^  and  P2  be  as  in  the  lemma.  Then,  by  (1.29), 

A(P)  -  oAfPp  +  (1  -  a)A(P2)-  a(l  -  a)APH’R-1HAP.  Since  a  e  (0,1),  the 

last  term  is  S  Q.  On  the  other  hand,  since  A(P)  =  0  (for  P  e  P  ) , 

A(P^)  ^  0  and  A(P2)  s  0  (for  both  belong  to  P),  the  last  term  must 
be  2  0.  As  a  >  0,  APH’R~*HAP  *  0.  Consequently,  A(Pp  *  A(P2)  »  0. 
which  implies  Pj  and  P2  belong  to  PQ.  □ 

The  above  lemma  says  in  essence  that  elements  of  PQ  cannot  be  written 

as  the  convex  combination  of  elements  other  than  those  of  P  . 

0 

The  next  theorem  is  the  main  result  of  this  section.  The  basic 
idea  of  its  proof  was  suggested  to  us  by  Professor  D.  Sorensen. 
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Theorem  1.16.  Let  P  e  P  .  Thmn  P  ta  cm  extreme  point  0/  P. 

Proof.  Let  P  €  PQ  and  assume  there  exist  P^  and  P2  «  P  such  that 
P  *  olPj^  +  (1  -  a)P2  for  a  e  (0,1).  We  shall  show  that  P  ■  Pj  *  Pj. 

By  Lenina  1.15,  e  PQ,  P2  «  PQ  and  APH’R^HAP  -  0.  The  last  of  these 
facts  implies  AP^R”1*  *  0,  i.e.  P^R’**  -  PjH’R"1*.  This  in  turn  implies 
that  (G  -  PjH'IR’1^  -  P^')*  *  (G  -  P2H')R“1(G  -  PjH’)':  -  E.  However, 
P^  and  P2  are  in  PQ  implies  FP^  +  PjF'  *  -E  »  FP2  +  P2F* .  Hence 
FAP  ♦  APF*  *  0.  But  F  is  a  stability  matrix  i.e.  ReA(F)‘  <  0.  Then  F 
and  -F'  have  no  eigenvalue  in  common,  which  implies  AP  ■  0  (see  e.g. 

[19]) .  Hence  *  p2  *  ?•  Q 

If  the  eigenvalues  of  F  were  distinct,  the  above  result  nay  alter¬ 
natively  be  proved  by  the  following. 

Proposition  1.17.  Let  Pj^  and  ?2  be  two,  elements  of  ?q  such  that 
APH'R”1HAP  »  0,  where  AP  •  P1  -  Pr  Then,  »  V2  and 

T2AP  +  APrj  -  0  ,  (1.30) 

where  and  V2  are  the  feedback  matrices  corresponding  to  P^  and  ?2 
respectively. 

Proof.  Recall  that  ^  »F  •  GR_1H  +  P^H’R'Hl.  As  was  indicated  in  the 
proof  of  Theorem  1.16,  APH'R'1HAP  >  0  implies  P^R-*5  *  P^'R"1*  . 

Then  T.  »  V  .  The  rest  of  the  result  then  follows  by  (1.27).  0 

X  A 

Therefore,  if  the  eigenvalues  of  F  are  distinct,  so  are  those  of 
(Corollary  1.8).  Then,  by  Corollary  1.9,  ?2  has  no  opposite  eigenvalues, 
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and  consequently  AP  in  (1.30)  will  be  zero  [19].  Hence,  in  view  of 
Lemma  1.15  any  P  e  PQ  is  an  exteme  point  of  P. 

Of  course,  if  a  *  n,  Theorem  1.16  would  follow  trivially  from 
Lemma  1.15  since  then  H'R~*H  is  full  rank  and  hence  the  condition 
APH' R-1HAP  »  0  implies  AP  =  0. 

1.3  Structure  of  the  Set  P:  Discrete-Time 

In  this  section,  we  shall  present  the  discrete-time  versions  of  some 
of  the  results  of  the  previous  section;  the  purpose  being  to  facilitate 
easy  comparisons.  At  times,  we  shall  need  to  resort  to  a  well-known 
equivalence  between  dynamic  systems  in  continuous-  and  discrete-time 
[8,  11,  18]  to  prove  some  of  these  results.  Further  properties  of  the 
set  P  in  this  discrete  setting  will  be  studied  in  Section  1.6. 

Let  P+  and  P_  be  defined  as  in  the  previous  section.  Here  again, 
since  $(eiw)  >  0  for  all  real  w,  P*  -  P*  >  0  [18].  Analogously  with 
(1 . 25) ,  in  this  case  we  define  the  feedback  matrix  to  be 

r  *  F  -  (G  -  FPH,)R(P)"1H.  (1.31) 

The  feedback  matrices  T*  and  T*  corresponding  to  P*  and  P*  satisfy 
the  properties:  |X(r*)|'  <  1  and  |X(r*)|  >  1  [11].  Furthermore,  it  can 
be  shown  [9]  that  $>0*0  >0  implies  that  T*  is  nonsingular. 

The  following  is  a  slight  modification  of  a  result  in  [18] .  It 
will  be  useful  in  what  follows.  Our  proof  will  shed  more  light  on  the 
structure  of  P.  Unlike  [18],  we  do  not  rely  upon  the  corresponding 


continuous -time  result. 
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Lenina  1.18  (a)  Set  R*  -  J  ♦  J*  -  «P*H’ .  Let  P  «  P+  and  set 

M*  «  (P  -  P*)'1.  27i«n  M*  eatieifiee 

-M#  +  r*M*r#  ♦  H’R^H  +  N  -  0  (1.32a) 


/ox*  aorta  nonnegative  definite  matrix  N. 

(b)  set  R*  -  J  +  J'  -  HP^H».  Let  ?  and  aa*M* 
M*  satisfies 

m*  -  .r*Vr*  +  H*R*_1H  +  N  *  0 


(P*  -  P)"\  Then 
(1.32b) 


for  some  nonnegative  definite  matrix  N. 


Proof,  (a)  Let  P  <  P+.  In  view  of  (1.12)  and  (1.16a),  we  have 
P  •  FPF’  ♦  (G  -  FPH')R(P)”1 (G  -  FPH')’  +  B^',  and 
P*  »  FP*F'  +  (G  -  FP^HOR^G  "  FP#H*) '  . 

Upon  subtracting  the  second  of  these  two  relations  from  the  first,  the 
following  is  obtained 

m;1-  ♦  kr(p)k'  -  m„k;  *  b2b2’  ,  (1-33) 

where  It  and  K,  are  defined  by  KRCP)  •  G  -  FPH’  and  K,R,  >  G  -  PP.H' .  It 
la  easy  to  see  that  R,  -  R(P)  ♦  HM^H'  and  that  K  >  [K„R,  -  PM*3H'  ]R(P)  1. 
Let  «  -  It  -  K..  Then  Alt  •  (K.H  -  F)M;lH'R(P)_1  and  K  >  K,  ♦  Alt  and 

(1.33)  becomes 

M*1  -  FM;1F,  +  (K#  +  AK)R(P)(K*  +  AK)  *  -  K*(R(P)  +  HM,lH')K;  +  * 

After  long,  but  simple  calculation,  we  get 

m;1  -  r,M;1r;  *  r,M;1H-RCP)'lnM;lr;  ♦  s28' 

or 

m;1  s  r*  [m;1  +  m;1h’r(p)“1hm;1ii’;  . 
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Taking  the  inverse  of  both  sides ,  we  obtain 

from  which,  the  following  relation  is  obtained 

m.  i  rl'Hi.r;1  -  r^H-R^Hr;1  , 

using  the  matrix  inversion  lemma 

[A  +  BD^C]"1  «  A”1  -  A~*B[D  ♦  CA‘1B]“1CA'1  .  (1.34) 

t 

Then,  premultiply  the  last  inequality  by  and  postmultiply  by  r„ 
to  get 

r>.r.  i  m.  -  h-r^h  , 

which  yields  (1.32a)  for  some  nonnegative  definite  matrix  N.  This 
proves  (a) .  The  proof  of  (b)  is  analogous .  □ 

As  a  first  application  of  this  leimna,  in  the  next  theorem,  we 
give  a  parametric  representation  for  the  set  P.  The  formula¬ 
tion  of  the  result  is  analogous  to  that  in  [2]  for  the  continuous -time 
case. 

Theorem  1.19  Let  M*(N)  and  M*(N)  be  the  solutions  of  (1.32a)  and  (1.32b) 
respectively ,  Then 

(a)  the  matrix  P  »  P*  +  [M*(N)]”*  belongs  to  P+  if  and  only  if 
N  0, 

(b)  the  matrix  P  »  P*  -  [M*(N)]-1  belongs  to  if  and  only  if 
N  i  0,  and 

(c)  P*  -  P*  *  [M*(0)]-1  *  [M^O)]"1. 
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Proof,  (a)  We  have  to  prove  the  "if”  part;  the  ”only  if”  was  proved  in 
Lemma  1.18.  Let  M*(N)  be  a  solution  of  (1.32a)  with  N  S:  0.  The  pair 
(T*,  H)  is  observable  for  (F,  H)  is  [20].  Recalling  that  T*  is  a 
stability  matrix,  a  standard  result  in  stability  theory  (see  e.g.[13; 
p.  86]1  implies  M*(N)  >  0.  Consequently  P*  +  [M*(N)]_1  €  P+.  The  proof 
of  (b)  is  analogous  and  that  of  (c)  is  immediate.  □ 

In  addition  to  its  significance  in  parametrizing  the  set  P,  Theorem 

* 

1.19,  together  with  Proposition  1.2  provide  us  with  a  procedure  to  gene¬ 
rate  stochastic  realizations  of  y  corresponding  to  an  arbitrary  element 
P  £  P+  uP_:  First  use  (1.16a)  to  compute  P*;  P*  will  be  obtained  from 
Theorem  1.19(c)  and  varying  N  over  the  nonnegative  cone  will  generate 

k 

the  other  elements  of  P+  u  P_.  The  realization  [F,  B,  H,  (R(P)  ,0)] 
corresponding  to  P  e  P+  P_  can  be  computed  via 

Bx  -  (G  -  FPH')R(P) (1.35a) 

B2B2*  -  -A(P)  .  (1.35b) 

This  procedure  for  generating  stochastic  realizations  requires  solv¬ 
ing  a  matrix  Riccati  equation  (1.16a)  in  order  to  determine  P*  in 
addition  to  the  burden  of  determining  P  e  P+  u  In  Section  1.6 
another  procedure  that  eliminates  the  intermediate  step  of  computing 
P  will  be  given. 

The  second  aspect  of  the  structure  of  P  that  will  be  discussed  in 
this  section  is  its  boundary. 

As  was  pointed  out  in  Proposition  1.1,  P  is  bounded.  A  complete 
characterization  of  the  boundary  points  of  the  corresponding  set  P  in 
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the  continuous -time  setting  was  given  in  Theorem  1.12.  Here,  by 
associating  the  discrete-time  quadruplet  [F,  G,  H,  J]  with  a  continuous¬ 
time  one  [F  ,  G  ,  H  ,  R  ](an  idea  that  is  well  known  [8,  11,  18]),  we 
shall  give  complete  characterization  of  these  boundary  points.  We  shall 
also  gain  more  insight  into  the  relationship  between  the  set  P  and  the 

set  P  . 
o 

In  order  to  distinguish  between  the  continuous  and  discrete  settings, 
we  shall  adjoin  the  letter  c(d)  (as  a  subscript  or  superscript)  to  the 
matrices  and  sets  of  the  continuous  (discrete)  setting  whenever  there  is 
a  need  for  distinction. 

Definition  1.20.  The  discrete-time  quadruplet  [F,  G,  H,  J]  and  the 

£ 

continuous -time  one  [F  ,  G  ,  H  ,  R  ]  axe  said  to  be  equivalent  if  P  *  P  . 

v  C  C  C 

The  following  proposition  shows  how  to  construct  a  quadruplet  in 
one  setting  from  one  in  the  other  setting.  The  proof  can  be  found  in 
[11]. 

Proposition  1.21.  Every  diearete-time  quadruplet  [F,  G,  H,  J]  ia  equi¬ 
valent  to  a  oontinuoua-time  one  [F.  G.  H  ,  R  ],  where 

c  c  c  c 


F  :  =  (F  +  I)'1(F  -  I),  (1.36a) 

c 

G  :  =*  ST  (F.  +  I)_1G,  (1.36b) 

c 

Hc  :  *  /T  H(F  +  I)"1,  (1.36c) 

R  :  =  J  +  J'  -  H(F  +  I) -1G  -  G'(F»  +  I)_1H'  (1.36d) 

c 


Conversely ,  eveiry  oontinuoua-time  quadruplet  (F,  G,  H,  R]  is  equivalent 
to  a  di8orete-time  one  [F^,  G^,  H^,  J^],  where 


■JSSKBSSSSSBR* 
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Fd 

:  -  (I  -  F)~ 

L(I  ♦  F), 

(1.37a) 

cd 

:  *  /T  (I  - 

F)'XG, 

(1.37b) 

Hd 

:  -  /T  H(I 

-  F)'1, 

(1.37c) 

Jd 

:-i-R  ♦  H(I 

-  F)~XG  . 

(1.37d) 

Now,  we  are  ready  to  state  the  discrete-time  version  of  Theorem  1.12. 

Theorem  1.22.  Let  P  e  P  and  set  Q  *  P  -  FP  P»  and  S  »  G  -  FP  H?. 

O  o  O  0  O  0 

Then  P  «  3P  if  and  only  if  the  matrix  M  * 
o  o 

Proof.  Let  the  quadruplet  [F  ,  G  ,  H  ,  R  ]  be  the  one  given  by  (1.36) 

c  c  c  c 

corresponding  to  [F,  G,  H,  J],  Let  *  (•)  be  its  corresponding  spectral 

v 

density.  It  is  not  hard  to  see  that  $(eXlU)  >  0  for  all  o»  e  R  implies 

$c(i<D)  >  0;  hence  R£  >  0.  By  Theorem  1.12,  PQ  e  3f^  if  and  only  if  the 
OS 

matrix  M  »  Fyc  c]  is  singular.  However,  it  is  easily  seen  that  [11] 

c  LscrJ 


X 

~ 7T  (F  +  I)"1  o” 

M 

~/T  (F *  +  I)-1  -(F  +  I)_1Hr 

-H(F  +  I)"1  I 

0 

0  I 

Hence  M  and  M  have  the  same  rank  and  one  is  singular  if  and  only  if  the 
0  o 

other  is .  □ 

Corollary  1.23.  PQ  =  3P 

Proof.  If  P  £  P  ,  then  M  in  Theorem  1.22  is  signular.  □ 

0  0  0 


Qo 

So 


R(P0) 


ia  singular. 
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Corollary  1.24.  Let  ai  <n  and  let  P^  and  P2  be  arbitrary  elements  of 
PQ.  Then  the  segment  [P^  P2]  c  3P.  (in  particular  [P#,P*]  =  3PJ 

Proof.  It  is  not  hard  to  see  that  P  «  P  if  and  only  if  p  e  Pc.  Hence. 

o  o 

by  Corollary  1.14,  P(ct)  »  aPj  +  (1  -  a)P2  «  3PC  «  3P,  for  all 

a  «  [0,1].  □ 

Theorem  1.25.  Let  P  €  P  .  Then  P  is  an  extreme  point  of  P. 

Proof.  By  Theorem  1.16,  P  is  an  extreme  point  of  Pc  »  P.  □ 

1.4  A  Hamiltonian  Approach  to  the  Factorization  of  the  Matrix  Riccati 
Differential  Equation 

Consider  the  matrix  Riccati  differential  equation 

P  -  A(P)  ;  P(0)  «  PQ  (1.38) 

which  is  of  the  type  encountered  in  Section  1.1.  For  convenience,  we 
recall  that 

AC?)  :  =  FP  +  PF»  +  (G  -  PH')R'1(G  -  PH’)’, 

where  the  quadruplet  [F,  G,  H,  RJ  is.  a  minimal  realization  of  the 
positive  real  matrix  function  Z(s)  defined  by  (1.19'J.  Hence  (1.38)  has 
a  unique  bounded  solution  on  [Q,00)  for  all  PQ  s  P*  [18].  In  this  section, 
we  shall  present  a  new  approach  for  factorizing  the  above  Riccati  equation 
based  on  Pontryagin’s  Maximum  Principle.  In  this  way,  we  shall  obtain 
a  new  derivation  of  the  non-Riccati  algorithms  due  to  Kailath-  [21]  and 
Lindquist  [22-24] .  Our  derivation  will  shed  more  light  on  these 
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algorithms  and  will  provide  new  links  to  the  Hamiltonian  formulation  of 
(1.38).  The  basic  idea  of  this  procedure  (which  was  indirectly  suggested 

to  us  by  Professor  L.  E.  Zachrisson)  is  to  consider  the  above  Riccati 

/ 

equation  as  arising  from  an  optimal  control  problem,  with  which  a 
Hamiltonian  function  is  associated.  The  factorization  will  then  be  a 
direct  consequence  of  the  fact  that  the  Hamiltonian  function  is  constant 
along  the  optimal  trajectory.  Another  procedure  based  on  the  Hamiltonian 
formulation  can  be  found  in  Bucy  and  Joseph  [25] . 

It  is  worth  noting  that  (1.38)  is  not  the  most  general  type  of 
Riccati  equations  that  one  might  encounter  (e.g.  one  might  have  an  extra 
constant  term  added  to  the  right  hand  side  of  (1.38)).  However,  our  aim 
here  is  to  convey  the  basic  ideas  of  the  method,  and  we  are  therefore 
using  the  form  (1.38)  which  arises  in  stochastic  realization  theory. 


Consider  the  following  control  problem.  Find  a  square  integrable 
control  function  u(*)  so  as  to  minimize 

JCujt^a)  *  -  yx'tojPQ  x(0)  +  T  [u (t)  'Ru(t)  +  2  x(t)'G  u(t)]dt, 

(1.39) 


subject  to 


x(t)  «  -F’x(t)  -  H’u(t)  ;  x(tx)  *  a  (1.40) 

Note  that  since  Z(s)  is  positive  real,  the  function  J(u;.,.)  is  bounded 
from  below  [8;  pp.  231-232].  Hence  5  •  *  inf  J(u;.,.)  >  Let 
(u^  ;  k  €  Z+)  (where  Z+*»  {0,1,2,...})  be  a  control  sequence  such  that 
J(u^;.,.)  +  5  as  k  ■*■*.  Then,  using  the  parallelgram  identity  in  Hilbert 
space  and  a  completeness  argument,  it  is  seen  that  there  is  a  control  u° 
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such  that  J(u0;.,.)  ■  5.  In  fact,  this  can  also  be  seen  from  the 
proof  of 

Proposition  1.26.  Thar*  exists  a  unique  square  inte gratis  function 
u°  minimising  J(u;t^,a).  Moreover t  J(u°;t^,a)  ■*  -~*'P(t^)a,  where  P 
is  the  unique  solution  of  (1.38). 

Proof.  Consider  the  function  y  x(t) 'P(t)  x(t).  Upon  differentiating 
this  quantity  and  integrating  between  0  and  t^,  we  obtain 

JCujt^a)  a'PCt^a-i  x(0)'(P(0)  -  PQ]x(0)  jV(t)  -  A(P(t))] 

+  i^tl|!u(t)-*-R‘1G'x(t)-R”1HPCt)x(t)  ||Rdt, 

where  A(P)  is  given  by  (1.22)  and  j|x||R  »  x'Rx.  Then  the  result  follows 
by  noting  that  P  satisfies  (1.38)  and  that  it  can  be  choosen  to  make  the 
last  term  zero.  □ 

The  optimal  control  u°  can  be  obtained  as  a  corollary  to  this  proposition. 
However,  since  the  Hamiltonian  function  will  play  an  important  role  in 
what  follows,  we  shall  instead  apply  the  Maximum  Principle.  First,  let 
x°  be  the  solution  of  (1.40)  corresponding  to  u°.  To  apply  the  Pontryagin 
Maximum  Principle,  define  the  Hamiltonian  function 

H(t,x(t),u(t),y(t))  a  y  u(t)  'Ru(t)  +  x*  (t)Gu(t) 

+  y(t)'[-F»x(t)  -  H'u(t)].  (1.41) 

Then  the  Maximum  Principle  requires  that  for  a  control  u°  to  be  optimal , 
H(t,x°(t)  ,u°(t)  ,y(t))  must-  be  minimal  i.e. 

(t,x°(t) ,u°(t) ,y(t))  »  0  »  Ru°(t)  +  G’x°(t)  -  Hy(t)  ,  (1.42a) 
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where  the  adjoint  function  y  is  given  by 

(t,x°(t)  ,u°(t)  ,y(t))  «-y  -  Gu°(t)  -  Fy(t) 


(1.42b) 


with  initial  condition  y(0)  *  Pox°(0).  Hence  the  optiaal  control  is 


u°(t)  -  -R"1[G,x°(t)  -  Hy(t)]. 


(1.43) 


Using  (1.40)  ,  (1.42)  and  (1.43) ,  it  is  easy  to  see  that  the  2n-vector 

P-O-T 


“x°“ 

_y  _ 


satisfies 


r*°i  rx°i 

rx°(tji 

“a 

i — 

i — 

N 

*n 

1 - 

*< 

1 _ 

w» 

1 — 

O 

v—/ 

•- 

1 _ 

'  o 

LPo*  c°)  J 

(1.44) 

where  F  is  defined  by  (1.26).  Then 


x  (t)  -  X(t)xc 
y(t)  *  y  (t)  x 


(1.45a) 

(1.45b) 


where  xq  »  x°(0)  and  X  and  Y  are  n  *  n  matrix  functions  satisfying 


"xi  rxi  rxcor 

.  * F  ; 

-YJ  LYJ  L_y(0)_ 


(1.45c) 


We  recall  the  following  well-known  fact: 

Proposition  1,27.  Let  X  and  Y  be  aa  in  (1.45).  Then  X(t)  is  non - 
singular  for  all  t  and  the  matrix  function  P(t)  :  »  Y(t)  X(t)"1 
is  the  unique  solution  of  (1.38). 

As  a  corollary  to  the  above  proposition  and  using  (1.45a,b),  it 
can  be  seen  that 


y(t)  »  P(t)  x  (t) . 


(1.46) 
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Now,  let  the  function  H°  :  [0,^]  +R  be  defined  by 

H°(t)  :  *  «(t,x°Ct)  ,u° (t)  ,y(t) )  .  (1.47) 

Lemma  1.28.  Let  H°  be  defined  by  (1.47).  Then 

H°(t)  •  -  ix°(t)^(t)x°(t).  (1.48) 

Moreover ,  (t)  «  0,  i.e.  H  is  oonetant  along  the  optimal  trajectory. 

Proof.  Clearly,  for  each  t  e  [0,t^],  the  problem  to  minimize  J(u;t,x°(t)) 
has  the  optimal  solution  (u°(s);s  e  [0,t]};  we  shall  misuse  notations 
somewhat  by  calling  this  restricted  function  u°  also.  Then,  by  Proposi¬ 
tion  1.26,  J(u°;t,x°(t))  x°(t)*P(t)x°(t) .  Hence,  since 

H°(t)  *  4^  (u°;t,x°(t))  +  y't)'x°(t),  where  y(t)  is  given  by  (1.46), 

gi/> 

(1.48)  follows.  The  fact  that  (t)  *  0  follows  from  elementary 
calculus.  □ 

Lemma  1.29.  Let  H°  be  defined  by  (1.47).  Then 

H°(t)  =  -ix^  M(t)x0  ,  (1.49) 

where  xq  3  x°(0)  and  the  n  x  n  matrix  function  M  is  defined  by 

M(t)  :  =  X(t)’P(t)X(t).  (1.50) 


Proof.  The  result  is  an  immediate  consequence  of  (1.45a)  and  (1.48)  □ 


34 


Lemma  1.30.  Let  M  at id  A  be  given  by  (1*50)  and  (1.22)  respectively.  Then 

M(t)  *  A(Pq)  Cl, 51) 

( i.e .  M  ie  constant.). 

Proof.  Using  (1.44)  and  (1.46),  it  is  easy  to  see  that  x°  »  -T’x0, 
where  T  is  the  feedback  matrix  (1.25).  Let  Hf  be  the  transition  matrix 
of  T.  Then  xQ  »  HffO.^a,  Hence,  H°(t)  »  a'f (0,^) 'M(t)f(0,t.)a, 
which  is  constant  for  all  a  e  Rn.  Consequently,  f  (0,tp ’M(t)f  (0,tj)  is 
a  constant  matrix,  hence  the  same  is  true  for  M.  But,  by  definition 
M(0)  *  P(0),  which,  in  view  of  (1.38)  is  the  same  as  A(Pq) ;  and  conse¬ 
quently  Cl .51)  follows.  □ 

Lemma  1.31.  Let  X  be  as  in  (1 . 45) .  Then 

^  (X(t)T1  =  r(t)  (XCt) *)“1, 
where  V  is  given  by  (1.25). 

Proof.  Using  (X*’1)  =  -X,'1XX^"1,  (1.45)  and  Y(t)  =  P(t)X(t),  the 
result  follows .  □ 

We  are  now  ready  to  state  the  main  result.  First  observe  that, 
since  the  n  x  n-matrix  A(Pq)  is  symmetric,  there  exist  two  constant 
matrices  N  and  S  such  that  A(Pq)  =  NSN’ ,  where  N  is  n  k  r,  S  is  r  x  r 
and  r  is  the  rank  of  A(Pq) .  ( For  example,  S  can  be  choosen  as  the 

signature  matrix  although  we  shall  use  a  different  S  below.) 


i 
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Theorem  1.32  ([21]).  Let  P  ie  the  unique  solution  of  (1.38).  Then 


P(t)  -  Q(t) SQ(t) 

;  P(0)  -  P0 

(1.52a) 

Act)  « rct)Q(t) 

;  Q(0)  -  n, 

(1.52b) 

where  Q(t)  »  (X’(t))”*N  and  N  and  S  are  given  as  above. 

Proof.  From  (1.50)  and  (1.51),  we  have  t  =  X'^AfP  )X-1,  which  by  the 
preceding  discussion  and  Lemma  1.31,  is  (1 . 52) .  □ 

As  an  application  of  the  factorization  (1.52),  consider  the  problem 
of  determining  the  Kalman  gain  K  given  by  (1 . 24c) ,  where  It  is  the 

solution  of  (1.24a),  in  which  case  *  0  and  A(0)  *  GR_1G'.  Hence 

_y. 

choosing  N  *  GR  *  and  S  *  I  in  (1.52),  we  obtain  the  following  non- 
Riccati  algorithm  for  K 


K  »  -QQ'H'R"*5  ;  K(0)  *  GR-*5 

(1.53a) 

Q  »  (F  -  KR-J5H)Q  ;  Q(0)  *  GR-*5, 

(1.53b) 

which  was  first  obtained  independently  by  Kailath  [21] ,  who  used  the 
factorization  above,  and  Lindquist  [22],  who  derived  it  from  basic 
principles  using  backward  innovations. 

As  another  application  of  interest  in  realization  theory,  consider 

the  case  where  P  is  any  element  of  P  as  defined  in  Section  1.1.  In 
o 

that  case  A(PQ)  *  -  B2B’,  wbere  B2  *s  g*ven  (l*20a),  and  we  may  choose 
N  and  S  to  be  and  -I  respectively  to  obtain 

P  *  -  QQ*  ;  P(0)  «  PQ 

i 

.Q  *  TQ  ;  Q(P)  =  B2. 


(1.54a) 

(1.54b) 
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Then  it  can  be  shovm  [2]  that  P(t)  e  P  for  each  t  e  R  and  therefore 
P(t)  is  the  state  covariance  of  a  realization  for  which  (in  view  of  (1.21)) 

Bj  »  (G  -  P(t)H')R“,S  and  B2  «  Q(t)  . 

Hence,  W6  have  the  following  non-Riccati  algorithm  generating  a  family  of 
realizations 

Bj  -  B2B»H’R")s  ;  B^O)  «  (B^  (l.S4c) 

B2  *  (F  -  B1R"JsH)B2  ;  B2  CO)  *  (Bq)2  (1.54d) 

for  a  given  initial  matrix  Bq  *  [(B^)Q, (B2)Q] .  (Note  the  parameter  t  is 
not  time  now.) 

This  algorithm  was  first  presented  in  [2].  In  Section  1.6,  we  are 
going  to  derive  its  discrete- time  version,  which  is,  as  expected,  more 
complicated.  Also,  in  Chapter  3,  we  shall  derive  the  nonstationary 
version  of  (1.S4). 

1.5.  A  Hamiltonian  Approach  to  the  Factorization  of  the  Matrix  Riccati 
Difference  Equation 

In  this  section,  the  matrix  Riccati  difference  equation 

P(f  +  1)  =  FP(t)F»  +  (G  -  FP(t)H’)R(t)-1(G  -  FP(t)H')  ’ ;  P(0)  *  P  .  (1.55) 

[which  is  the  same  as  P(t  +  1)  -  P(t)  =  A(P(t)),  where  A  is  given  by 
(1.15)]  will  be  considered,  where  the  quadruplet  [F,  G,  H,  J]  is  a  mini¬ 
mal  realization  of  S(z)  defined  by  (1.9)  and  R(t)  =  J  +  J’  -  HP(t)H* . 

The  aim  is  to  obtain  a  factorization  of  this  matrix  Riccati  equation, 
analogous  to  the  one  obtained  in  the  previous  section.  This  will  not 
only  facilitate  comparisons  with  the  continuous  setting,  but  will  also 
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be  the  basis  for  the  next  section.  Moreover,  as  will  be  seen  shortly, 
the  lack  of  symmetry  between  the  two  settings  will  again  be  illustrated. 

Before  stating  the  control  problem  which  gives  rise  to  the  above 
Riccati  equation,  we  note  that  the  matrix  T  :  ■  J  +  J’  is  nonsingular 
since  T  ■  R(t)  +  HP(t)H* ,  and  R(t)  is  assumed  to  be  positive  definite 
for  all  t.  Again  the  problem  is  to  find  a  control  u(.)  which  minimizes 

*l"l 

J(u;t.,a)  «4x(0)'P  x(0)  +  \  \  [u(t  +  l)'Tu(t  ♦  1) 

1  4  °  t«0 

+  2x(t  +  l)'Gu(t  +  1)],  Cl*  56) 

subject  to 

x(t)  »  F'x(t  +  1)  +  H'u(t  +  1)  ;  x(t  )  »  a.  #  (1.57) 

As  in  the  continuous-time  setting,  the  assumption  that  S(z)  is  posi¬ 
tive  real  insures  the  boundedness  of  the  functional  J  and  the  existence 
of  the  optimal  control  u°.  Also,  using  an  argument  similar  to  that  of 
Proposition  1.26,  it  is  not  hard  to  check  that 

J(u°;t,x°(t))  »  -  j  x° (t) P (t) x° (t) ,  (1.58) 

where  x°  is  the  solution  of  (1.57)  corresponding  to  u°  and  J(u;t,x(t)l  is 
the  value  function  defined  by 

i  i  t‘1 

J(u;t,x(t))  -4x(0)'P  x(0)  +  j  l  [u (k  +  l)'Tu(k  ♦  1) 
z  0  *  k*0 

+  2x(k  +  l)'Gx(k  +  1)].  (1.59) 

As  in  Section  1.4  we  aTe  misusing  notations  somewhat  by  denoting  u° 
restricted  to  [0 , t]  u°  also.  Again,  the  optimal  control  u°  can  be  obtained 
from  the  derivation  of  (1.58);  however,  we  shall  resort  to  the  Hamil- 
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tonian.  To  exploit  the  analog/  with  the  continuous-time  problem,  we 
shall  use  the  Maximum  Principle  of  [26]  with  the  Hamiltonian  function 


H(t,x(t  ♦  1),  u(t  ♦  1),  y(t))  -  |  u(t  +  l)'Tu(t  +  1)  ♦  x»(t  ♦  l)Gu(t  ♦  1) 

+  y(t)»[x(t  +  1)  -  F'x(t  +  1)  -  H«u(t  +  1)], 

■  (1.60) 

where  y(t)  -  y(t  +  1)  ■  (t,x°(t  +  1),  u°(t  +  1),  y(t)),  i.e. 

y(t  ♦  1)  -  Fy(t)  -  G  u°(t  ♦  1)  ;  y(0)  -  Pox°(0),  (1.61a) 

u°  and  x°  are  as  above.  Note  that  then 

X°(t  ♦  1)  -  x°(t)  -  5^  (t,x°(t  ♦  1),  u°(t  ♦  1),  y(t)) . 

• 

Hence,  with  this  formulation,  there  is  a  complete  analogy  between  the 
discrete-  and  the  continuous -time  settings  just  exchanging  derivatives 
for  differences. 

Now  the  Maximum  Principle  states  that  H(t,x°(t  +  1),  u°(t  +  1),  y(t)) 
has  a  minimum  for  u  *  u°(t)  i.e.  by  differentiation 

Tu°(t  +  1)  +  G ' x° (t  +  1)  -  Hy(t)  »  0. 

which  implies 

u°(t  +  1)  -  -  ♦  1)  -  Hy(t)] .  (1.61b) 
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where  the  matrix  F  is  given  by 

‘A.’  H'T_1H 

A 


GT  G 


(1.63a) 


and 


A  *  F  -  GT-1H. 


Then,  as  in  the  continuous -time  setting 


x  (t)  »  X(t)x„ 


y(t)  -  Y(t)x 


(1.63b) 

(1.64a) 

(1.64b) 


where  xq  *  x  (0)  and  X  and  Y  are  n  *  n  matrix  functions  satisfying 


A'X(t  +  1)  *  X(t)  -  H,T-1HY(t) 


X(0)  *  I 


(1.65a) 


Y(t  +  1)  *  AY(t)  +  GT_iG’X(t  +  3)  ;  Y(0)  »  PQ.  (1.65b) 


To  exploit  the  analogy  with  the  continuous-time  setting,  the  follow¬ 
ing  lemma  is  needed. 

Lemma  1.33.  Let  P  be  any  n  x  n  symmetric  matrix  and  let  R  :  =  T  -  HPH' 
be  nonaingular.  Then ,  the  matrix  [I  -  H'T~*HP]  is  full  rank  and  its 
inverse  is  [I  +  H'R^HP]. 


Proof.  It  is  easy  to  check  that 

[I  -  H’T-1HP]  [I  +  H’R^HP]  *  I. 


Hence,  the  two  matrices  in  the  left  hand  side  are  full  rank.  0 
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X 

Lemma  1.34.  Let  [y]  be  the  solution  of  (1.65)  acrid  let  P  be  the  solution 
of  the  Riooccti  equation  (1.S5).  Then  Y(t)  »  P(t)X(t). 

Proof.  Replace  Y  by  PX  in  (1.65b)  to  obtain 

P(t  +  l)X(t  +  1)  »  AP(t)x(t)  +  GT"1G*X(t  +  1) 

Then,  use  Lemma  1.33  to  obtain  X(t)  from  (1.65a)  with  Y(t)  set  equal 

to  P(t)X(t).  Then  inserting  this  into  the  above  equation,  we  obtain 

* 

[P(t  +  1)  -  P(t)  -  A(P(t))]X(t  +  1)  «  0 
which,  in  view  of  (1.55)  is  an  identity.  Hence  the  lemma  follows.  □ 

As'  a  first  corollary  to  the  above  two  lemmas,  we  have 

Corollary  1.35.  Let  X  and  P  be  as  in  Lemma  1.34.  Then  the  matrix  X(t) 
is  nonsingular  for  all  t  e  Z+.  Moreover ,  the  matrix  A  is  nonsingular . 

Proof.  From  (1.65a),  we  have  A*X(1)  *  [I  -  H'T^HP  ] ,  which  by 
Lemma  1.33  is  full  rank  since  R(0)  :  *  T  -  HPqH*  is  nonsingular.  Hence 
A  and  X(l)  are  also.  The  result  now  follows  by  repeating  this  argument 
for  t=l,2, . ...  0 

As  another  corollary,  we  obtain  the  counterpart  of  Proposition  1.27. 

Y 

Proposition  1.36.  Let  [y]  be  the  solution  of  the  system 


X(t+1)“ 

"  A*"1  -A' -1H,T-1H 

"X(t)“ 

"x(0)“ 

_i 

Y (t+1) _ 

S 

GT*1G,A'"1  A-GT^G'A^i'T^H 

_Y(t)_ 

• 

> 

_Y(0)  _ 

as 

p 

1-  o  — * 

(1.66a) 
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Then  P(t)  :  *  Y(t)X(t)”*  is  the  solution  of  (l.SS)  and 

yCt)  «  P(t)x°(t).  (1.66b) 

In  analogy  with  the  continuous-time  setting ,  define  the  n  x  n-matrix 

M(t)  *  X(t  +  l)’6Pt+1X(t  +  1)  (1.67) 

where  X(t)  is  given  by  (1.64)  and  (1.66a)  and  <SPt+l  :  «  P(t  +  1)  -  P(t). 
Just  as  in  Section  1.4,  we  want  to  express  the  optimal  sequence 
{H°(t);  t  *  0,1,..., tj}  defined  by 

H°(t)  :  *  H(t,x°(t  +  1),  u°(t  +  1),  y(t))  (1.68) 

in  terms  of  M. 

Proposition  1.37.  Let  H°  and  M  be  given  by  (1.68)  and  (1.67)  respectively. 
Then 

H°(t)  *-jx^M(t)xo  -  j  5x°'(t  +  l)P(t)6x°(t  +  1)  (1.69) 

where  x°  *  x°(0)  and  6x°(t  +  1)  *  x°(t  +  1)  -  x°(t). 

Proof.  It  is  easy  to  see  that 

H°(t)  *  J(u°;t  +  l,x°(t  +  1))- J(u°;t,x°(t))  +y(t)'  (x°(t  +  1)  -  x°(t)), 

where  J(u;t,x(t))  is  given  by  (1.59).  Remember  that  u°  restricted  to 
[0,t]  minimizes  J(u;t,x°(t))  .  Hence,  by  (1.58)  and  (1.66b) 

H° (t)  =  jx°(t)'P(t)x°(t)  -  j  x°(t  +  l)'P(t  +  l)x°(t  +  1)  -  x°(t)'P(t)x°(t) 
+  x°(t)'P(t)x°(t  +  1), 

which,  upon  adding  and  subtracting  the  quantity  j  x°(t  +  l)'P(t)x°(t  +  1) 
and  using  (1.64)  and  (1.67),  yields  (1.69).  0 
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However,  unlike  the  continuous-time  setting,  the  sequence  H°(t)  is 
not  constant,  nor  is  the  matrix  function  M.  We  shall  now  use  the  Riccati 
equation  (1.55)  to  get  an  alternative  expression  for  the  Hamiltonian 
sequence  H° (t) . 

Lemma  1.38.  Let  if3.  and  M  be  given  by  (1.68)  and  (1.67).  Then 

H°(t)  «-jx'[M(t  -  1)  -  M(t  -  l)X(t)_1H’R(t  -  l)'1H(X(t)’)'1M(t  -  l)]xj 

-  j5x°(t  +  1) *P(t)6xW(t  +  1).  (1.70) 

Proof.  First,  using  (1.60),  we  may  write 

H°(t)  *  ju°(t  +  l)'Tu°(t  +  1)  +  x°(t  +  I)'Gu0(t  +  1)  -  y(t)  'x°(t) 

+  y(t)'x°(t  +  1). 

Upon  inserting  (1.61b)  and  (1.66b)  into  this  equation,  we- obtain 

H°( t)  m.k  x°(t)'P(t)x°(t)  -  +  1) ,GT~1G,x°(t  +  1) 

+  ix0(t)tP(t)H'T"1HP(t)'X0(t)-2-x0(t)'P(t)x0(t)  +  x°  (t)  *  P  (t)  x°  (t  +  1). 

Next,  it  is  not  hard  to  check  that  the  Riccati  equation  (1.55)  can  be 
reformulated  to  Tead 

P(t)  -  A[P(t  -  1)  +  P(t  -  1)  HfR(t  -  1)-1HP (t  -  1)]A'  -  GT^G', 

where  A  is  given  by  U.63b).  Inserting  this  value  for  GT-1o'  into  the 
above,  we  get 

H°(t)  *-jX°(t)'P(t)x°(t)  +  |x°(t  +  l)»A[P(t  -  1) 

+  P(t  -  l)H'R(t  -  l)“1HP(t)']A,x°(t  +  1)'  +  j  x°(t)'P(t)H'T"1HP(t)x°(t) 

-  j 6 x° (t  +  l)'P(t)6x°(t  +  1). 
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Finally,  it  is  not  difficult  to  check,  using  (1.61b)  and  (1.57)  that 

x°(t  +  1)  -  A»’1[I  -  H*T“1HP(t)]x0(t)  (1.71) 

which  inserted  into  the  above  equation  for  H° (t)  yields  (1 . 70) .  0 

The  above  lemma,  together  with  Proposition  1.37,  provide  us  with 
a  recursion  for  M. 

Lemma  1.39.  Let  M  be  given  by  (1.67).  Then  M  satisfies 
M(t)  «  M(t  -  1)  -  M(t  -  l)X(t)  —1H* R(t  -  l)"*1H(X(t)  ')"*1M(t  -  1).  (1.72) 

Proof.  Using  the  same  argument  as  in  the  proof  of  Lemma  1.31,  this 
follows  from  the  fact  that  T(t)  is  nonsingular  for  all  t,  which  follows 
from  Lemma  1.40  below.  0 

Before  we  state  the  main  results,  we  shall  need  the  following 

Lemma  1.40.  Let  X(t)  be  as  in  (1.64)  and  (1.65).  Then 

(X(t  +  l)')"1  *  r(t) (X(t)')"1,  (1.73) 

where  V  is  the  feedback  matrix  (1.31). 

Proof.  Upon  applying  the  matrix  inversion  lemma  (1.34)  to  (1.65a),  in 
view  of  Proposition  1.36,  one  obtains 

(X (t  +  l)')"1  *  [F  +  FP(t)H'R(t)-1H  -  G(T_1  +  T-1HP (t) U'R(t)-1)H] (X(t) ') 


(1.74) 
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The  last  term  in  (1.74)  can  be  written 

-G(T-1R(t)  +  T“1HP(t)H»)R(t)“1H(X(t),)‘1, 
which  is  -GR(t) ~^H(X(t) ’ ) -1 .  0 

Analogously  to  the  continuous -time  case,  let  r  :  *  rank  M(0) ;  then 
M(0)  «  X(l)'6PjX(l)  can  be  written  NSN',  where  N  is  n  x  r  and  S  is  r  x  r. 

Theorem  1.41.  Let  -CP Ct) ;  t  e  Z+}  be  the  solution  of  (1.55)  and  let  N 
and  S  be  as  above .  Then  P  can  be  determined  from  the  system 

P(t  +  1)  -  P(t)  -  Q(t)Z(t)Q(t)  ’  ;  P(0)  *  Pq  (1.75a) 

where  the  matrix  sequences  (Q(t) ;  t  e  Z+},  {Z  (t) ;  t  e  Z+>  are  generated 

by 

Q(t  +  1)  »  [F  -  U(t  +  l)R(t  +  l)_1H]Q(t)  ;  Q(0)  «  T(0)N  (1.75b) 

U(t  +  1)  *  U(t)  +  FQWZ^QCtrH'  ;  U(0)  «  G  -  FP0H»  (1.75c) 

t 

R(t  +  1)  »  R(t)  +  HQ(t)Z (t)Q(t) *H'  ;  R(0)  »  J  +  J’  -  HPqH*  (1.75d) 

Z(t  +  1)  *  Z(t)  +  Z(t)Q(t)  ,H,R(t)”1HQ(t)Z(t)  ;  Z(0)  *  -S.  (1.75e) 

Proof.  Let  Q(t) :  =  (X(t  +  l)’)"^  and  U(t) :  a  G  -  FP(t)H» .  Then  by  (1.73)  , 
Q(t  +  1)  a  F(t  +l)Q(t)  which,  by  (1.31)  and  the  definition  of  U,  yields 
(1.75b).  Next,  in  view  of  (1.72)  and  the  fact  that  M(0)  =  NSN',  it  can 
be  easily  seen  that  M(t)  =  -NZ(t)N’.  Then  (1.75a)  follows  from  (1.67). 
Finally,  (1.75c)  and  (1.75d)  follow  from  (1.75a)  and  the  definitions  of 
U  and  R.  □ 

As  an  application  of  the  factorization  (1.75),  consider  the  problem 
of  determining  the  Kalman  gain  K  given  by  (1.18e).  In  this  case  IIo  *  0 
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and  A(0)  »  GT^G'.  Hence,  we  nay  choose  N  «  X,(l)GT"Js  and  S  »  I,  in 
which  case, 

K(t)  «  U(t)R(t)_J*  ;  K(0)  «  GT-*5  ,  (1.76) 

where  U(t),  R(t)  are  given  by  (1.75)  with  initial  conditions 
Q(0)  «  GR’1*,  U(0)  «  G,  R(0)  -  T  and  Z(0)  -  -I. 

This  version  of  the  algorithm  is  the  one  originally  presented  by 
Lindquist  [27,  83].  The  general  case  can  be  found  in  [29],  where  the 
following  factorization  was  used 

6Pt+l  3  “  «PtH'R(t  -  l)-1H*Pt]r(t)\  (1.77) 

Relation  (1.77)  can  be  obtained  from  (1.72)  by  inserting  (1.67)  and  not¬ 
ing  that  (X(t  +  l)»)"1X(t)'  =  T(t). 

1.6  Non-Riccati  Algorithms  Inside  the  Set  B. 

Each  P  e  P  can  be  interpreted  as  the  state  covariance  matrix  (1.2) 
of  the  corresponding  realization  (1.14)  [5],  Consequently,  there  is  a 
minimum-variance  (P*)  and  a  maximum-variance  (P*)  realization  for  each 
of  which  B2  *  0. 

Faurre’s  Algorithms  (1.16)  show  that  the  solutions II (t)  and  n (t) 
converge  to  P*  and  P*  respectively  as  t  -*■  “.  However,  these  solutions 
start  outside  the  set  P  (for  0  P) .  In  this  section,  P*  and  P*  will  be 
approached  from  inside  P.  In  particular,  for  a  given  PQ  e  P_  (Pq  e  P+) , 
we  shall  construct  a  trajectory  extending  from  PQ  to  P*  (from  Po  to  P*) 
so  that  this  trajectory  is  a  totally  ordered  set  of  matrices  satisfying 
(1.12).  Such  a  result  will  enable  us  to  construct  a  countable  family  of 
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realizations  of  y,  the  state  covariances  of  which  are  totally  ordered, 
yielding  a  procedure  to  obtain  a  countable  family  of  realizations  without 
resort  to  the  intermediate  step  of  determining  the  auxilliary  quantity  P. 
Indeed,  the  (non-Riccati)  factorization  of  the  previous  section  will  be 
the  basis  of  this  procedure. 

The  work  presented  in  the  rest  of  this  chapter  is  the  discrete-time 
version  of  Section  6  in  [2] .  The  procedure  is  more  complicated  than  its 
continuous -time  counterpart.  However,  this  is  natural  and  is  largely 
due  to  the  fact  that  the  matrix  R(P)  depends  on  P,  while  its  counterpart 
in  the  continuous  case  does  not. 

First,  let  us  start  with  the  following 

Lemma  1.42.  Let  A(P)  be  defined  by  (1.15).  Then,  for  each  Pq  e 
the  solution  (P(i);  i  e  Z+)  of  the  matrix  difference  equation 

P(i  +  1)  -  P(i)  =  A(P(i))  ;  P(0)  *  Po 

satisfies  (i)  P(i)  VP  for  all  i  e  Z+,  (ii)  P(i2)  5  Pfi^)  for  i^ 
and  (iii)  if  PQ  c  P  ,  P(i)  •+■  P*  as  i  •+■  «. 

Proof.  Since  Pc',  A(P  )  «  -  B„Bi  [see  (1.35)].  Then  P(i)  satisfies 
0  0  ^  * 

(1.75)  with  N  *  X(1)’B2  and  S  *  -I.  Then  2(0)  *  I  and  consequently  it 
follows  that  Z(i)  *  0  for  all  i  c  Z+.  Therefore,  in  view  of  (1.75a), 
6P^+1  *  A(P(i))  s  0.  Hence  P(i)  c  P  for  all  i  c  Z+.  This  proves  (i) 
and  (ii) . 

To  prove  (iii),  let  =  P*  -  P(i).  An  argument  similar  to  that 
used  in  proving  Lemma  1.18  yields  that 


P, 

(1.78) 
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m.  .  »  rnt,  r*'  -  r*M.H,R?1ml  r** ;  m  «  p*  -  p  . 

1+1  1  X  1  1  0  o 

Since  Mq  >  0  (for  Po  £  P_)  and  Mi+1  -  Mi  *  0  by  (1.75a),  >  0 

V  i  c  Z+.  Consequently  M7*  exists.  Let  M*(N)  be  as  defined  in 
Theorem  1.19.  Define  V.  :  -  M*(0)  -  MT1.  Then  V.  ,  -  V.  «  -  (M?\  -  Mj1). 
By  (1 . 34) ,  we  have 

mtJj  *  r*»‘1M:1r*"1  +  -  HMiH*)”1HT*~1 

=  r*»“1M‘1r*"1  +  r*,‘1H'R*"1Hr*"1. 

Consequently, 

v4x1  -v.  *  mT1  -  r*’_1M:1r*"1  -  r*,"1H,R*"1Hr*"1. 

1+1  11  1 

But  by  (1.32b),  M*(0)  satisfies 

-M* (o)  +  r*»’1M*(o)r*'1  +  r*»‘1H,R*'‘1Hr*‘1  » o. 

Therefore 

V  -V  .  «  -V .  +  r+^V.r*’1  ;  i  e  Z+. 

l+l  i  i  i  . 

Since  ^{(r*)"1)!  <  1 ,  V  ^  •+  0  as  i-+»  and  consequently 
Md  *+  [M* (0)]”1  »  P*  -  P*.  Hence,  P(i)  +  P*  as  i  +  ».  □ 

Now,  we  are  ready  to  state  the  first  main  result  of  this  section: 
the  non-Riccati  algorithm.  Since  the  realizations  are  determined  by  the 

matrix  B,  the  algorithm  will  be  given  in  terms  of  this  parameter. 

i 

Let  B  oe  the  set  of  all  B  »  (B^Bj)  given  by  (1.35)  with  P  e  P. 

Let  Bq,  B_  and  8+  be  defined  analogously  in  terms  of  PQ,  P_  and  P+. 

It  is  clear  that  Bq  *  {B  c  B  ]  B£  *  0} .  In  particular,  let  B*  and  B* 
denote  those  elements  of  8o  corresponding  to  P*  and  P*  respectively. 
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Theorem  1.43.  Let  [F,  Bq,  H,  (R*,  0)]  be  cm  arbitrary  realization  of 
y ,  and ,  for  each  i  e  Z+,  let  B(i)  =  [B^(i),  B2(i)]  be  given  by 

B1(i)  =  UCi)R(i)-Js  (1.79a) 

B2(i)  =  Q(i)Z(iA  (1.79b) 

where  the  matrix  sequences  U(i) ,  Q(i),  Z (i)  and  R(i)  are  generated  by 
(1.75)  with  initial  conditions  U(0)  *  (B0)iRo>  Ro  “  Ro*  *  ^Bo^2 

and  Z(0)  *  I.  For  each  i  e  Z+  let  P(i)  be  the  solution  of 

-  P  f  FPF'  +  B(i)B(i)»  =  0  (1.80) 

Then j  for  all  i  e  1  ,  [F,  B(i),  H,  (R  (i) ,  0]  is  a  realization  of  y, 
with  state  covariance  P(i).  Moreover t  if  Bq  e  B_ ,  B(i)  -*■  (B*,0)  as 
i  -*- *>.  Finally t  the  sequence  (P(i) ;  i  6  Z+}  satisfies  conditions  (i)-' 
(i.ii)  of  Lemma  1.42  and  the  difference  equation 

P(i  +  1)  -  P(i)  *  -  B2(i)B2(i)’.  (1.81) 

Proof.  Let  P  be  the  state  covariance  of  the  initial  realization 
o 

[F,  Bo,  H,  (R^,  0)],  and  let  {P(i);  i  e  Z^be  the  trajectory  through  PQ 
defined  by  Lemma  1.42.  Then  P(i)  e  P  for  all  i  e  Z+.  Define  Bj(i)  and 
B2(i)  by 

Bjfi)  :  *  [G  -  FP (i)H,]R(i)"’J* 
and 

B2(i)  :  *  Q(i)Z(iA 

Then,  since  P(i  +  1)  -  P(i)  «  -  Q(i)Z(i)Q(i) » ,  (1.81)  follows.  From  the 
proof  of  Theorem  1.41  we  see  that  U(i)  *  G  -  FP(i)H*.  Hence 
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(1.79a)  follows.  Equations  (1.78)  and  (1.81)  imply  (i) B2 Ci)  *  =  -A(P(i)), 

which  together  with  the  above  definition  of  B^(i)  yield  (1.80).  Since 

|X(F)  |  <1  and  (F,  B(i))  .is  controllable  (for  (F,  Bq)  is),  the  solution 

of  (1.80)  is  unique,  symmetric  and  positive  definite.  This  fact, 

together  with  (1.80)  and  the  definition  of  B^(i)  insure  that  (P(i),  B (i) ) 

satisfies  (1.12).  Hence  [F,  B(i),  H^-CRfi)3*,  0)]  is  a  realization  of  y 

with  state  covariance  P(i).  By  Lemma  1.42,  P(i)  satisfies  conditions 

(i)  -  (iii) .  Finally,  by  the  same  lemma,  P(i)  -*■  P*  as  i  00  if  P  e  P  . 

o  - 

Hence,  if  Bo  e  8_,  B^(i)  +  B*  as  i  +  “  and,  in  view  of  (1.81), 

B2 (i) B2 (i)  •+■  0  i.e.  B2 (i)  •+•  0.  0 

Remark.  Throughout  this  section,  we  have  used  the  parameter  i  rather 
than  t  to  stress  the  fact  that  this  quantity  has  nothing  to  do  with  time. 

The  next  task  is  to  construct  a  sequence  belonging  to  set  P  which 
is  increasing  (rather  than  decreasing)  in  i  and  which  converges  to  P*. 

In  the  continuous -time  case,  this  can  be  done  using  the  same  Riccati 
equation.;  the  analogue  of  (1.78).  Here,  unfortunately,  to  achieve 
this,  we  shall  have  to  follow  an  indirect  procedure  through  a  "backward” 
approach.  To  this  end,  let  us  review  certain  facts  about  backward 
realizations . 

We  would  like  to  consider  realizations  of  y  that  evolve  backward 


in  time  of  the  form 


x(t  -  1)  *  Ax(t)  +  Bw(t) 
y(t)  «  Cx(t)  +  §w(t). 


(1.82a) 

(1.82b) 
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where  w  is  a  normalized  white  noise  sequence  such  that,  for  each  t,  w(t) 
is  uncorrelated  to  future  (rather  than  past)  values  of  x. 

Hence,  we  can  state  the  backward  wide  sense  stochastic  realization 
; problem  as  follows:  Given  the  spectral  density  4>  of  y  (or  equivalently,, 
the  quadruplet  [F,  G,  H,  J]),  determine  all  quadruplets  [A,  B,  C,  D] 
with  dim  A  *  n  minimal  and  |X(A)|  >  1  such  that  the  output  y  of  (1.82b) 
has  spectral  density  4>. 

The  problem  has  been  studied  by  Pavon  [9]  and  earlier  in  [10,  11, 
18].  We  shall  outline  some  of  the  results  of  [9,  11]  here,  since  we 
shall  need  them  below. 

The  backward  realization  problem  is  deterministic  in  nature  and  is 
equivalent  to  the  dual  spectral  factorization  problem  considered  by 
Anderson  [30]  and  Faurre  [11],  which  requires  determining  all  minimal 
unstable  factors  W(z)  of  4>(z).  Since  W(z~*)W(z)'  *  (z)  * ,  this  problem 

is  equivalent  to  finding  all  minimal  stable  factors  W(z~*)  of  <J>(z)'. 
Consequently,  we  have  reduced  the  problem  to  the  one  considered  before. 
In  fact  all  solutions  to  this  problem  are  given  by 

[A,  B,  C,  D]  *  [t'Vt,  T"1^,  B2)V,  G’T,  (R(P)%,  0)V] 

where  T  and  V  are  as  before,  B^,  B2,  P  and  R(P)  satisfy 

P  *  F’PF  +  BjBJ  +  B2BJ  (1.83a) 

H’  =  F’PG  +  Bj&CP)*5  (1.83b) 

R(P)  =  J  +  J'  -  G’PG  ,  (1.83c) 

and  P  is  an  n  *  n  symmetric  positive  definite  matrix. 
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Again,  here  it  is  no  restriction  to  take  T  «  V  «  I  i.e,  to  consider 
backward  realizations  of  the  form 

x(t  -  1)  =  F’x(t)  +  B^Ct)  +  B2v(t)  (1.84a) 

y(t)  =  G'x(t)  +  RfPj^Ct)  Cl. 84b) 

where  w  =  . 

Analogous  to  what  we  have  done  before,  let  P  be  the  set  of  all  solu¬ 
tions  of  (1.83).,  define  the  map 

A(P)  =  -  P  +  F'PF  +  (H't-  F'PG)R(P)”1  (H*  -  F’PG) ' ,  (1.85) 

and  let  Pq  *  {P  |  A(P)  =  0}.  Then  P  =  {P  |  A(P)  s  0}  and  it  has  the  same 

properties  as  P.  Hence  there  exist  two  elements  P.  and  P*  in  P  such 

*  >o 

that  P*  S  f  s  P*  for  every  P  e  P.  It  is  well  known  [11,  18]  that  P  is 
related  to  P  by  P  =  {P^1  |  P  e  P}.  Thuse  P*  *  (P*)"1  and  P*  =  (P*1) . 

This  also  explains  the  choice  of  (1.16b,  c)  by  Faurre. 

In  fact  there  is  a  one-one  correspondence  between  forward  and  back¬ 
ward  realizations.  It  was  shown  in  [9]  how  to  compute  the  backward 
elements  B  and  R(P)  from  the  knowledge  of  the  forward  ones.  In 
the  following  proposition,  for  convenience,  we  also  include  the  converse 
statement . 

Proposition  1.44  (a) ([9]).  Assume  the  quadruplet  [F,  B,  H,  (R(P)\  0)] 
solves  the  forward  problem.  Set 

1  «-P_V1B(I  -  B’P-V,  and  (1.86a) 

(RfP)^,  0)  *  [ (R(P)^,  0)  -  HF_1B] [I  -  B,P”1B]^. 

Then,  [F»  B,  G»,  (ft(P)\  0)j  solves  the  backward  one. 


(1.86b) 
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(b)  Assume  the  quadruplet  [F\  B,  G»,  (^(P)*4,  0)]  solves  the 
backward  problem .  Set 

B  ■  -  P"1F,-1B(I  -  S'?"1!) h  and  (1.86c) 

<*(P)\  0)  «  [(&(?)**,  0)  -  G'F,”1B) [I  -  l'rll)h  (1.86d) 
Then,  [F,  B,  H,  (RCP)^,  0)]  solves  the  forward  one. 

The  following  lemma  is  the  backward  counterpart  of  Lemma  1.42  and 
Theorem  1.43. 

Lenma  1.45.  (a)  Let  X(P)  be  defined  by  (1.8S).  Then,  for  each 

€  Pj  the  solution  (P(i) ;  i  e  Z+}  of  the  matrix  difference  equation 

P(i  +  1)  -  P(i)  -  A(P(i))  ;  P(0)  «  PQ  (1.87) 

satisfies  (i)  P(i)  c  P  for  all  i  e  l*  ,  (ii)  P(i2)  s  P(i2)  for 
i^  s  i2  and  (Hi)  if  PQ  «  P_,  P(i)  ■+•  P*  as  i  *►  ®. 

-  -h 

(b)  Let  [F* ,  Bq,  G’,  (R^,  0)]  be  an  arbitrary  backward  realization  of 
y,  and ,  for  each  i  c  Z+,  let  B(i)  «  [^(i),  §2(i)]  be  given  by 

Bx(i)  -  0(i)R(i)”^  (1.88a) 

S2(i)  -  Q(i)?(i),S  (1.88b) 

where  the  matrix  sequences  U(i),  5(i),  2(i)  and  R(i)  are  generated  by 
Q(i  +  1)  «  [F»-0(i  +  l)P(i  ♦  D^G'lQCi)  ;  Q(0)  »  (Bg) 2  (1.88c) 

0(i  +  1)  «  0(i)  ♦  F’5(i)2(i)5Ci)’G  0(0)  -  (So)^  (1 . 88d) 

R(i  ♦  1)  -  fc(i)  ♦  G’5(i)2(i)$(i)*G  ;  ft(0)  »  Po  (1.88e) 

2(i  +  1)  -  2(i)+2(i)5(i)  ,Gft(i)"1G,5(i)2(i)  ;  2(0)  «  I  (1.88f) 
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For  each  i  c  Z+,  let  P(i)  be  the  solution  of 

-P  +  F’PF  +  B(i)S(i)»  «  0.  (1.89) 

Then,  for  all  i  €  Z+,  [F1,  B(i),  Gf,  (RCi)^,  0 f\is  a  backward  realization 
of  y,  with  state  covariance  P(i).  Moreover ,  if  8q  t  B  ,  B(i)  -*•  (8*,  0) 
as  i  *►  •  .  The  sequence  (P(i) ;  i  e  Z+}  is  the  same  as  in  (a)  and  it  also 
satisfies  the  difference  equation 

P(i  +  1)  -  P(i)  --82(i)82(i)«  .  (1.90) 

Proof,  (a)  By  an  argument  similar  to  that  of  Theorem  1.41,  it  can  be 
shown  that 

P(i  +1)  -  P(i)  -~Q(i)2(i)$(i)»,  (1.91) 

where  Q,  2,  together  with  ft  and  0  are  given  by  (1.88).  The  rest  of  the 
proof  is  analogous  to  that  of  Leona  1.42.  .  . 

(b)  Let  PQ  be  the  state  covariance  of  the  initial  backward  reali¬ 
zation  [F1,  ftQ,  G1,  (ft^,  0)],  and  let  (P(i);  i«  Z*}be  the  trajectory 
through  PQ  defined  by  part  (a).  Then  the  proof  of  (b)  follows  upon  defining 
Bx(i)  :  -  (H*  -  F * P (i) G) ft (i) ^and  fi2(i)  :  »  QU^i)*5.  □ 

Now,  we  are  ready  to  state  the  second  main  result  of  this  section. 

It  provides  us  with  a  trajectory  inside  P  converging  to  P*  and  a  non- 
Riccati  procedure  to  generate  a  family  of  realizations,  the  state 
covariances  of  which  are  increasing. 


Theorem  1.46.  (a)  Let  P  e.  P  and  set  p  :  =  p"1.  Let  (Pfil;  i  e  Z+) 

0  0-0  v  /  * 

be  the  solution  of 

P(i  +  1)  -  P(i)  =  P(i)N(i)P(i)  ;  P(0)  «  PQ.  (1.92) 

where 

N(i)  -  Q(i)[2(i)  -  QWPWQCi)]"1^!)',  (1.93) 

end  $(i)  and  2(i)  are  given  by  (1.88).  Then  (£)  P(i)  e  F  Vi  e  Z+, 

(vO  P(i^)  £  P  Ci2^  for>  5  ^2  an<^  Ciii)  if  PQ  c  P+,  P(i)  P*  as 
i  -*■  o°. 

(b)  Let  [F,  Bq>  H  (R^,  0)]  be  an  arbitrary  realization  of  y3  and 
for  each  i  £  Z+,  let  [F',  B(i),  G* ,  (^(i^O)]  be  the  family  of  realiza¬ 
tions  of  Lemma  1.4S3  having  state  covariance  P(i).  Let 

B(i)  :  «=-P(i)"1F»'1B(i)(I  -  B(i) ,P(i)“1B(i))*S,  (1.94a) 

(RCi)35,  0)  :  *  [(R(i)  **,())  -  G,F’”1B(i)]  [I  -  B(i)  •  P (i)~XB (i) 3^.  (1.94b) 

h 

Then 3  [F,  B(i),  H»(R(i)  ,0)]  is  a  realization  of  y  with  state  covariance 
P(i)  «  P(i)”1.  Moreover 3  if  Bq  c  8+,  B(i)  (B*,  0)  as  i  ».  Finally 3 
the  sequence  {P (i) ;  i  e  Z+)  is  the  same  as  in  (a)  and  it  also  satisfies 
the  difference  equation 

P(i  ♦  1)  -  P(i)  «-B2(i)B2(i)». 

Proof,  (a)  Let  Prt  e  P.  Then  P  :  «  P"1  c  P.  Let  (P(i);  i  £  Z+)  be 
the  sequence  generated  by  (1.91)  and  (1.88)  corresponding  to  ?o.  Set 
P(i)  :  «  P(i)"1  for  each  i  £  Z+.  Then,  by  (1.91),  P(i  +  l)"1  «  P(i)'1  - 
-  Q(i)2(i)Q(i)'.  Using  (1.34),  we  obtain  (1.92)  and  (1.93).  Since 
P(i)  £  P  «  {P”1  |  P  £  P),  P(i)  £  P  for  all  i  £  Z+.  This  proves  (i). 
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By  condition  (ii)  of  Lemma  1.45,  P(i2)  s  ^(ij)  ^or  ^  ^rom 
P(i1)  £  P(i2)  for  i^  £  i^  follows.  Finally,  if  PQ  e  P+,  Then  Pq  e  P_. 
Therefore,  by  condition  (iii)  of  Lemma  1.45,  P(i)  P*  i.e. 

P(i)~^  +  P*~*  as  i  ». 

(b)  The  proof  of  this  part  is  an  immediate  consequence  of 
Proposition  1.44.  0 

Remarks.  (1)  Theorems' L 43  and  1.46  have  the  following  interpretation. 
Let 

XQ(t  +  1)  =  Fxo(t)  +  (Bo)lU(t)  +  (Bo)2v(t)  (1.95a) 

y(t)  *  Hx0(t)  +  R^u(t)  (1.95b) 

be  an  arbitrary  (wide  sense)  realization  of  y  with  state  covariance  P  . 

(a)  Let  B(i)  ■  [B(i)  ,  B2(i)]  and  R(i)  be  given  by  (1.79) 
and  (1.75).  Then  for  each  i  £  Z+, 

Xi(t  +  1)  «  Fx.(t)  +  Bj  (i)u(t)  +  B2(i)v(t)  (1.96a) 

y(t)  =  Hxi(t)  +  RCi^uCt)  (1.96b) 

is  a  realization  of  y  with  state  covariance  P(i)  *  E{x^(t)x^(t)*} 

given  by 

P(i  +  1)  -  P(i)  -  -  B2(i)B2(i) *  ;  P(0)  -  P0.  (1.97) 

Furthermore,  {P(i);  i  e  Z+)  is  a  decreasing  sequence  in  i  such  that,  if 
Bq  €  8_,  P(i)  P*  and  B(.i)  ■+■  (B*,  0)  as  i  ®,  where  B*  is  the  Kalman 
gain  of  the  steady-state  Kalman-Bucy  filter  (.1,17). 
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(b)  Let  B(i)  *  [B^i)^  (i)]  and  R(i)  be  given  by  (1.94). 

Then  (1.96)  is  a  realization  of  y  with  state  covariance  P(i)  given  by 
(1.97).  But  now,  the  sequence  {P (i) ;  i  e  Z+}  is  increasing  in  i  such 
that,  if  Bq  e  B+,  P(i)  P*  and  B(i)  (B*,  0)  as  i  °°.  In  fact,  B* 
is  the  forward  counterpart  (in  the  sense  of  Proposition  1.44)  of  the 
gain  of  the  steady-state  backward  Kalman- Bucy  filter. 

(2)  As  an  application  of  Theorems  1.43  and  1.46,  we  can  exploit 
the  equivalence  between  dynamical  systems  in  the  discrete-  and  the 
continuous -time  settings  summarized  in  Proposition  1.21,  to  construct 
a  discrete  trajectory  inside  the  "continuous"  set  P  as  defined  in 
Section  1.1  and  to  generate  families  of  realizations  for  the  continuous¬ 
time  problem  via  difference  equations  rather  than  differential  ones. 

To  this  end,  suppose  we  are  given  the  continuous-time  quadruplet 
[F,  G,  H,  R].  Let  [F^,  G^,  Hj,  Ja)  be  defined  by  (1.37).  Then,  as  we 
have  seen  in  Section  1.3,  P^  ■  P.  For  each  P  e  P  define 

AjdO  --P  *  yFJ  *  (Gd  -  FdPH')R-1(Gd  -  FjPH'V, 

where  -  H^PH^ . 

For  each  PQ  c  P,  the  solution  {P(i);  i  e  Z+}  of 

P(i  ♦  1)  -  P(i)  «  Ad(P(i))  ;  P(0)  -  P0 

satisfies  conditions  (i)  -  (iii)  of  Lemma  1.42.  Also,  the  analogue 
of  Theorem  1.43  holds  using  the  quadruplet  [F^,  G^,  H^,  J^]. 


CHAPTER  2 


SMOOTHING  FOR  LINEAR  DISCRETE-TIME  STOCHASTIC  SYSTEMS 
IN  THE  CONTEXT  OF  STOCHASTIC  REALIZATION  IHEORY 

2.1.  Introduction 

The  linear  least-squares  estimation  problem  is  of  great  impor¬ 
tance  in  stochastic  systems  theory.  The  classical  results  on  this 
subject,  which  were  started  in  frequency  domain  language  are  primarily 
due  to  Kolmogrov  [31]  and  Wiener  [32] .  However,  here  we  shall  be  con¬ 
cerned  with  the  state  space  formulations  introduced  by  Kalman  [33], 
Kalman  and  Bucy  [34],  and  others. 

The  problem  deals  with  estimating  the  state  of  a  given  system 
from  noisy  measurements.  It  can  be  classified  into  three  categories: 
given  a  past  record  of  data,  estimating  current  values  of  the  state 
(filtering),  future  values  (prediction)  and  past  values  (smoothing). 

In  this  chapter,  it  is  the  last  category  that  we  are  interested  in. 

The  smoothing  problem  has  received  considerable  attention  in  the 
literature  in  the  last  few  years  [35-51,61],  (See  the  survey  paper 
[52]  for  further  references.)  Originally,  our  interest  in  this  problem 
was  caused  by  the  well-known  two-filter  formula  due  to  Mayne  [38] 
and  Fraser  [39],  on  which  topic  a  large  number  of  papers  had  been 
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written  [40-42,  45-51],  and  which,  nevertheless,  we  think  had  not  re¬ 
ceived  a  satisfactory  stochastic  interpretation.  This  led  to  our  con¬ 
tinuous-time  papers  [64,65].  Here,  we  shall  give  the  discrete-time 
version  of  this  theory. 


Let  {x(t);  t  e  [0,T  +1]}  and  {y(t);  t  e  [0,T]}  (here  [0, T]  :* 

0,  1,  ...  T)  be  two  stochastic  processes  of  dimensions  n  and  m  respec¬ 
tively,  defined  as  the  solution  of  the  linear  stochastic  system 


(S) 


x(t  +  1) 

y(t) 


F(t)x(t)  +  B(t)w(t)  ;  x(0) 

H(t)x(t)  +  D(t)w(t)  •  y(0) 


5 

0  , 


(2.1a) 

(2.1b) 


where  w  is  a  p-dimensional  (p  £  m),  zero  mean  white  noise  sequence 
satisfying 


E{w(t)}  -  0  and  E{w(t)w(s)'}  «  16  ,  (2.2) 

and  £  is  an  n-dimensional ,  zero  mean  random  vector  with  finite  covari¬ 
ance  matrix  N  :*  E{££'}  and  uncorrelated  with  w.  The  matrix  R(t)  :« 
D(t)D(t)'  is  positive  definite  for  all  t  e  [0,T],  and  F,  B,  H  and  D 
are  time-varying  matrices  of  dimensions  compatible  with  x,  y  and  w. 
Finally,  F(t)~*  exists  for  all  t  e  [0,T], 


The  model  S  will  be  called  a  linear1  stochastic  system',  x  is  its 
state  process,  y  its  output  process  and  w  its  input  process. 

The  state  covariance  function  P(t)  :*  E{x(t)x(t)’}  clearly  satis¬ 
fies  the  Liapunov-type  equation 


P(t  +  1)  «  F(t)P(t)F(t) '  +  B(t)B(t) ’  ;  P(0)  -  N  . 


(2.3) 
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Th e  fixed-interval  smoothing  problem  can  now  be  stated  as  follows: 
for  an  arbitrary  t  c  [0,T],  find  the  linear  least-squares  estimate  x(t) 
of  x(t)  given  (y(s);  s  e  [0,T] }  i.e.,  the  wide-sense  conditional  expec¬ 
tation  [70] 

S(t)  =  E{x(t)  |  y(s)  ;  s  c  [0,T]}  .  (2.4) 

In  this  chapter,  we  shall  study  this  problem  from  a  new  angle, 
our  aim  being  to  develop  a  unified  theory  which,  we  feel,  the  literature 
is  still  lacking.  It  is  true  that  some  authors  [49]  have  attempted  to 
do  so;  nevertheless,  we  feel  those  attempts  are  not  satisfactory.  Many 
problems  of  interpretation  of  the  existing  solutions  have  remained  un¬ 
resolved.  The  approach  we  follow  to  provide  such  a  theory  employs  con¬ 
cepts  and  techniques  from  the  stoohastic  realization  theory  developed 
in  [2,9,10]  and  in  [1,53-60],  The  basic  idea  is  to  embed  the  given  model 
S  in  a  class  of  models  S  all  having  the  same  output  process  (not  only 
the  covariances  are  the  same  as  in  Chapter  1,  but  also  the  processes 
are  equal  for  each  t  a. s.)  and  the  same  Kalman-Bucy  filter.  Such  a 
representation  is  called  a  proper  [2]  stoohastic  realization  of  y  to 
distinguish  it  from  the  wide  sense  realizations  of  Chapter  !1.  The  class 
S  will  be  shown  to  contain  an  element  (S*),  which  together  with  another 
element  (S*)  which  does  not  belong  to  S  in  general,  contains  all  the 
information  on  y  needed  to  estimate  x. 

We  note  that  the  model  S  we  are  considering  is  more  general  than 
the  one  usually  encountered  in  the  literature  in  that  Bw  and  Dw  may  be 
correlated,  i.e.,  BD»  *  0.  Secondly,  as  will  be  shown  later,  one  of 
the  major  obstacles  in  developing  this  theory  is  the  fact  that  P(t)  is 
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in  general  not  positive  definite.  We  could  assume  that  the  model  S  is 
minimal  (see  Section  2.3);  however,  this  assumption  will  not  guarantee 
the  invertibility  of  P  on  the  whole  interval  [0,T],  In  our  continuous¬ 
time  papers  [64,65],  this  obstacle  was  removed  upon  imposing  the  extra 
assumption  that  the  system  matrices  are  analytic  functions,  which  implies 
that  S  is  totally  controllable.  It  is  worth  noting  that  in  the  smoothing 
literature,  conditions  to  insure  invertibility  of  P  are  either  ignored 
all  together  [61]  or  mistakenly  assumed  to  hold  on  the  entire  interval 
as  a  consequence  of  complete  controllability  (minimality)  of  S.  This 
is  clearly  incorrect  (see  e.g.  [63]).  Hence,  at  times  we  shall  apply 
the  generalized  Moore-Penrose  pseudo -inverse  P*  of  P  wherever  P  }  0, 
leading  to  certain  nontrivial  complications. 

The  organization  of  this  chapter  goes  as  follows.  Section  2.2  is 
devoted  to  preliminaries.  A  strict  sense  version  of  some  results  on 
backward  representations  developed  in  Section  1.6  is  presented.  Our 
results  are  generalizations  of  those  of  [9]  obtained  in  the  stationary 
setting.  In  Section  2.3,  the  stochastic  realization  theory  concepts 
will  be  developed  and  in  Section  2.4,  we  present  a  discuss5.on  about 
the  frame  space,  the  importance  of  which  is  that  it  contains  the  smooth¬ 
ing  estimate.  Section  2.5  will  be  devoted  to  the  model  S*  mentioned 
above.  In  Section  2.6,  we  give  a  general  formula  for  the  smoothing 
estimate  in  terms  of  S*  and  S*.  Section  2.7  is  devoted  to  deriving 
some  previously  known  formulas  for  the  smoothing  estimate  and  to  inter¬ 
pret  them  in  terms  of  our  realization  theory. 


2.2.  Preliminaries 


Let  H  be  the  space  of  all  centered  stochastic  variables  with 
finite  second  order  moments;  H  is  a  Hilbert  space  when  endowed  with 
the  inner  product  (£,n)  *  E(5n}.  If  u  is  a  p-dimensional  stochastic 
vector  process  with  components  in  H,  define  H(u)  to  be  the  subspace 
spanned  by  (Uj,  U£,  . ..,  u^}.  Then,  for  each  stochastic  vector  process 
(z(t);  t  e  [t  ,tj]}  and  t  £  [tQ,tj],  define  Ht(z)  to  be  H(z(t)); 

H(z),  H~(z)  and  H*(z)  will  denote  the  closed  linear  hulls  in  H  of  all 

t  L 

subspaces  Hs(z)  such  that  s  e  [t0»tj],  [tQ,t]  and  [t,tj]  respectively. 
Given  ri  e  H  and  a  subspace  c  H,  E{ri  |  H^}  will  be  the  orthogonal 
projection  of  n  onto  H^,  i.e.,  the  wide  sense  conditional  mean  [70]. 

We  shall  write  E(n  |  u)  in  place  of  §{n  |  H(u)}.  The  process  n  is 
called  a  wide  sense  Markov  process  if  E(n(t)  |  H~(ti)}  *  E{Ti(t)  |  0(s)} 
for  s  s  t  or  equivalently,  E{ti(s)  j  H*(r))}  *  E{ti(s)  j  n(t)},  i.e., 
the  Markov  property  is  independent  of  the  time  direction. 

Let  be  the  transition  function  of  F,  i.e., 

$(t+l,s)  *  F(t)$(t,s)  ;  4>(s,s)  *  I  . 

Since  the  state  process  x  defined  by  (2.1a)  satisfies 

t-1 

x(t)  *  *(t,s)x(s)  +  l  $(t,j+l)B(j)w(j)  (2.5a) 

j»S 

for  s  c  [0,t-l]  and,  consequently,  H*(w)  x  H0(x)  *  H“(w)  3  H~(x),  ^ 
easily  follows  that  E{x(t)  |  H^(x)}  *  E(x(t)  |  x(s)}.  Hence,  x  is  a 
Markov  process.  However,  the  difference  equation  (2.1a)  is  not  symme¬ 
tric  with  time:  the  two  terms  in  the  right-hand  member  of  (2.5a)  are 
orthogonal  if  and  only  if  t  >  s. 
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Of  great  importance  in  this  chapter  is  a  backward  counterpart 

of  (2.1a).  By  the  above  argument,  such  an  equation  cannot  be  readily 

obtained  by  merely  reversing  the  time  direction.  In  this  section,  we 

shall  construct  the  backward  version  of  (2.1a)  under  the  assumption 

that  N  >  0;  later  in  Section  2.5,  we  shall  remove  this  assumption. 

Since  the  covariance  matrix  function  P  is  given  by 

t-1 

P(t)  =  $(t,0)N$(t,0) 1  +  l  4>(t,j+l)B(j)B(j)  ^(t.j+l)’  ,  (2.5b) 

3“0 

it  follows  that  P(t)  >  0  for  all  t  e  [0,T]  if  and  only  if  N  >  0. 

In  this  case,  the  process 

x(t-l)  =  P(t)‘2x(t)  (2.6) 

is  well-defined  for  all  t  e  [-1,T],  with  components  in  H.  Let  P  denote 
its  covariance  function 

P(t-l)  :=  E{x(t-l)x(t-l) *}  .  (2.7) 

Then,  the  backward  version  of  (2.1a)  is  given  by 

Lemma  2.1.  Let  x  be  the  state  process  of  the  linear  stochastic  system 
S  ,  and  let  N  >  0.  Then,  the  process  {X(t);  t  e  [-1,TJ)  defined  by 
(2.6)  satisfies  the  backward  recursion 

x(t-l)  «  FCt)'x(t)  +  6(t)?(t)  ;  x(T)  *  l  C2.8) 

for  t  e  [0,T],  where  \  «  PCT+lJ^xCT+l), 

B(t)  *  -P(t)"1P(t)"1B(t)[I  -  B(t) *P(t+l)rlB(t)]^  (2. 9.) 

and  w  is  a  p- dimensional  normal  iz,ed  white  noise  sequence  satisfying 
(2,2)  and  the  condition  H~ (w)  i  H*(x)  for  t  e  [0,T]  and  is  given  by 
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»(t)  =  [I  -  B(t)  'P(t+l)“1B(t)]Js[w(t)  -  B(t)  'F‘  (t)"1P(t)"1x(t)]  .  (2.10) 
The  covariance  function  (2.7)  is  given  by 

P(t-l)  =  P(t)-1 

and  satisfies  the  Liapunov  equation 

P(t-l)  *  F»(t)P(t)F(t)  +  B(t)B(t) *  ;  P(T)  *  PCT+l)'1  (2.11) 

If  N  0,  equations  (2. 8) -(2.11)  are  defined  for  all  t  e  [0, T]  for  which 
P(t)  >  0. 

This  lemma  is  a  generalization  of  the  results  of  Section  1.6 
which  were  obtained  in.  [9]  for  the  stationary  case.  This  is  a  strict 
sense  version  of  the  wide  sense  results  presented  in  [48,50]  which  are 
insufficient  for  our  purposes  since  they  are  deterministic  rather  than 
probabilistic  in  nature.  Moreover,  we  have  chosen  to  write  the  back¬ 
ward  version  (2.8)  in  terms  of  x  rather  than  x,  since  this  choice  will 
yield  a  backward  Kalman-Bucy  filter  which  is  invariant  over  the  class  5. 

The  proof  of  this  lemma  is  based  on  the  observation  that,  as  the 
orthogonal  decomposition 

x(t+l)  «  E(x(t+1)  |  H~(x))  +  [x(t+l)  -  §(x(t+l)  |  H~(x)}]  (2.12) 

yields  (2.1a),  the  orthogonal  decomposition 

x(t)  *  E{x(t)  |  H*+1(x)}  +  [x(t)  -  E{x(t)  j  H*+1(x)}]  (2.13) 


yields  the  backward  version  (2.8).  (The  basic  idea  of  this  proof  first 
appeared  in  [2].)  We  shall  need  the  following  lemma,  the  proof  of  which 
can  be  found  in  most  standard  books  on  estimation  theory. 


64 

Lemma  2.2.  Let  u  and  v  be  two  stochastic  vectors  with  components  in 
H,  and  assume  E{w'}  >  0.  Thent 

E{u  |  v}  *  E{uv,}(E{w,})"1v  . 

Proof  of  Lemma  2.1.  We  shall  prove  the  lemma  for  the  case  N  >  0;  if 
N  }  0,  everything  will  be  the  same  for  t  e  [0,T]  such  that  P(t)"1 
exists.  Since  x  is  Markov, 

§{x(t)  j  H^+1(x)}  =  E{x(t)  |  x(t+l)}  . 

Upon  using  Lemma  2.2,  the  right-hand  side  is  P(t)F(t) ,P(t+l)~1x(t+l) . 
Inserting  this  into  (2.13)  and  multiplying  by  P(t)-1  yields 

x(t-l)  =  F(t)  'x(t)  +  Pft^^nCt)  (2.14a) 

i 

where  n(t)  :=  x(t)  -  E{x(t)  |  x(t+l)}.  But,  x(t)  * 

F(t)-1x(t+l)  -  F(t)_1B(t)w(t),  and  consequently 

n(t)  =  -F(t)”^B(t) [w(t)  -  E{w(t)  |  x(t+l) } 

=  -F(t)”1B(t)  [w(t)  -  B(t) ,P(t+l)"1x(t+l)]  (2.14b) 

where  we  have  used  Lemma  2.2  to  obtain  the  last  relation.  Now,  from 
(2.3),  it  is  not  hard  to  see  that 

B(t)»P(t+l)"1F(t)  *  [I  -  B(t) ,P(t+l)'1B(t)]B(t) 'F(t) ’”1P(t)  . 

Then  inserting  this  and  (2.1a)  into  (2.14b),  we  obtain 

U(t)  * -F(t) -1B(t) [I  -  B(t) •P(t+l)"1B(t)] [w(t)  -  B(t) 'F(t)rlP(t)x(t) ]  . 

This  together  with  (2.14a)  yields  (2.8)  with  6  and  w  given  by  (2.9)  and 
(2.10).  From  the  above  discussion,  it  follows  that 
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[I  -  B(t)  ,P(t+l)"1B(t)]Js  w  (t)  «w(t)  -  E{w(t)  |Hj+1(x)} 

(c£.  [9]),  which  relation  implies  that  w  is  a  white  noise  such  that 
H^(w)  1  H^+1(x)  =  H*(x);  the  factor  in  front  of  w(t)  is  the  appro¬ 
priate  normalization  factor  so  that  w  satisfies  (2.2),  as  can  be  easily 
checked.  Finally,  P(t-l)  =  E{X(t-l)x(t-l) •}  *  P(t)”1E{x(t)x(t) ‘JPCt)'1  * 
P(t)"*.  Equation  (2.11)  is  obtained  from  (2.8)  precisely  as  (2.3)  is 
obtained  from  (2.1a).  0 

2.3.  Forward  and  Backward  Realizations 

Let  the  output  process  y  be  defined  as  in  Section  2.1.  Any 
system  of  type  (2.1)  (with  e  H,  for  i  *  1,  2,  . . . ,  n,  w  satisfying 
(2.2)  and  ^  i  H(w)  for  all  i)  having  y  as  its  output  is  called  a 
realization  of  y.  Clearly,  the  components  of  x,  y  and  w  belong  to  H. 

The  purpose  of  this  section  is  to  introduce  the  two  models  S*  and 
5*,  the  knowledge  of  which  determines  the  frame  space  i(to  be  defined 
in  Section  2.4),  which  in  turn  contains  the  smoothing  estimate. 

As  we  have  seen  in  Section  1.1,  the  linear  least-squares  estimate 

x*(t)  *  E(x(t)  1  (y)}  (2.15) 

of  the  state  process  x  of  S  is  generated  on  [0,T]  by  the  Kalman-Bucy 
filter 

x*(t+l)  -F(t)x*(t)  +B*(t)R*(t)")5[y(t)  -H(t)x*(t)]  ;  x*(0)  *  0,  (2.16a) 
where  the  Kalman  gain  function  B*  is  given  by 


B*  -  [FQ*H*  +  BD’IR;;15  , 


(2.16b) 
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R*  *  HQ*H«  +  DD*  (2.16c) 

and  the  error  covariance  matrix 

Q*(t)  =  E{[x(t)  -  x*(t)][x(t)  -  x*(t)]'}  (2.16d) 

is  the  solution  of 

(^(t+D^FCtDQJtJFCtD'-BJtJBJO'+BWBCt)*  ;  Q*(0)=N  (2.16e) 

which  is  a  matrix  Riccati  difference  equation  when  (2.16b)  is  inserted. 

As  we  shall  see  shortly,  there  are  other  realizations  of  y  which 
have  (2.16a)  as  their  Kalman-Bucy  filter.  Hence  define  $  to  be  the 
class  of  all  realizations  S  of  y  such  that  R(t)  :«* 

D(t)D(t)'  >  0  Y  t  c  [0,T]  and  such  that  the  corresponding  Kalman-Bucy 

filter  is  given  by  (2.16a),  that  is,  it  has  the  same  matrix  functions 
-h 

F,  H  and  K*  :«  B*R*  as  those  of  (2.16a)  and  consequently  the  same  esti¬ 
mates  (x*(t);  t  e  [0,T+1] } . 

The  sequence  (w*(t);  t  e  [0,T]}  defined  by 

w*(t)  =  R*(tr%[y(t)  -  H(t)x*(t)]  (2.17) 

is  called  the  innovation  process.  It  is  a  normalized  white  noise  sat¬ 
isfying  (2.2)  and  characterized  by  the  property  H~(w*)  «  H^(y)  for  all 
t  e  [0,T].  Combining  (2.16a)  and  (2.17),  we  obtain  the  model 

x*(t+l)  «  F(t)x*(t)  +  B*(t)w*(t)  ;  x*(0)  *  0 

(S*)  k  (2.18a) 

y(t)  *  H(t)x*(t)  +  R*(t) \*(t)  , 

k 

which  clearly  belongs  to  S.  It  can  be  immediately  seen  that  the  co- 
variance  matrix  P*(t)  :*  E(x*(t)x*(t) '}  of  x*(t)  satisfies 


P*(t+1)  «  F(t)P*(t)F(t)'  +  B*(t)B*(t)'  ;  P*(0)  *  0  , 


(2.18b) 
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and  that 

Q*  ~  p  ~  P*  • 


(2.18c) 


It  is  essential  at  this  point  to  show  that  S*  is  uniquely  defined 
regardless  of  the  choice  of  the  S  e  S  from  which  S*  was  formed,  i.e., 
that  the  matrices  B*  and  R*  are  both  invariants  for  the  class  3  (by 
definition,  F  and  H  clearly  are).  To  this  end,  we  need  to  define  the 
n  *  m-matrix  function 

G  *  FPH*  +  BD*  (2.19a) 

for  each  realization  S  e  S;  P  is  its  state  covariance  function. 


Lemma  2.3.  Let  G,  R*  and  B*  be  defined  by  (2.19a),  (2.16c)  and  (2.16b) 
respectively.  Then  G,  R*  and  B*  are  invariants  for  the  class  S. 


Proof.  Let  S  e  S  be  arbitrary  and  let  G  be  as  in  (2.19a).  Then 
P  «  Q*  +  P*.  Also,  by  (2.16b),  BD’  *  K*  -  FQ*H*.  Inserting  these  two 
relations  into  (2.19a),  we  obtain 

G  *  FP*H*  +  BJ^  >  (2.19b) 

which,  by  the  definition  of  S,  is  invariant  over  S.  Next,  since 
AQ(t)  :*  E{y(t)y(t) •}  «  H(t)P(t)H(t) 1  +  D(t)D(t)',  R*  «  Aq  -  HP*H*, 
which  does  not  depend  on  the  choice  of  S,  for  Aq  and  P*  do  not.  Finally, 
since  B*  *  K*R*  (or  B*  *  (G  -  FP^H^R***),  B#  is  also  invariant  over  S.  □ 


Consequently,  F,  H,  G,  B„  and  R#  are  invariants  for  S,  whereas 
B,  D,  P,  w  and  x  will  vary  with  different  realizations  S  e  S.  Actually, 
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even  the  dimension  p  of  w  will  vary.  However,  since  R  is  of  full  rank, 
we  will  always  have  p  £  m. 


The  model  S*  belongs  to  a  class  of  realizations  for  which  p  is 

minimal,  i.e.,  p  =  m.  Define  S  to  be  the  subclass  of  all  S  e  S  such 
'  *  o 

that  p  *  m  and  x(0)  e  H(y).  Let 


(S  ) 
v  oJ 

be  a  realization  in 


xo(t+n  -  Fx0(t)  ♦  B0w0(t) 
y(t)  *  HxQ(t)  +  DQw0(t) 

with  state  covariance  P 
o 


> 


o' 


x0(°) 


C2.20) 


As  Dq  is  invertible. 


xQ(t+l)  =  Fx0(t)  ♦  B0D^[y(t)  -  Hx0(t)]  ;  xQ(0)  «  ZQ  .  (2.21a) 

Let  (2.1)  be  an  arbitrary  realization  in  S  and  define 


%  *  P  -  P0  •  (2.21b) 

Then,  by  (2.20)  and  Lemma  2.3, 


R  «  DO  »  *  A  -  HP  H»  «  HO  H»  +  DD' 

o  0  O  O  0  X) 


(2.21c) 


and 


Bq  *  (G  -  FPoH')R‘Js  «=  (FQgH'  +  BD')R^  .  (2.21d) 

Inserting  (2.21d)  into  the  equation  (2.3)  for  PQ  and  subtracting  from 
(2.3),  we  conclude  that  satisfies 


Q0(t+l)«F(t)Q0(t)F(t)’ -B0(t)B0(t)»+B(t)B(t)»  (2.21e) 

Equations  (2.21)  look  formally  like  the  filtering  equations  (2.16),  only 
the  initial  conditions  are  different.  In  view  of  the  assumption  that 
£0  c  H(y),  (2.21a)  implies  H(xQ)  =  H(y) .  We  shall  call  a  realization 
S  c  S  internal  if  it  satisfies  the  condition  H(x)  c  H(y)  and  external 
otherwise  [2]. 
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Therefore,  by  the  above  discussion,  we  have  shown  that  all  S  e.  SQ 
are  internal. 

Our  next  task  is  to  derive  a  backward  realization  3  for  each 
S  c  S.  We  shall  begin  by  restricting  our  attention  to  the  subclass 
5+  consisting  of  all  S  e  S  for  which  N  >  0.  The  class  S+  is  nonempty. 
This  can  be  seen  by  using  an  argument  similar  to  that  in  the  continu¬ 
ous-time  case  [64].  The  basic  idea  is  that  the  stochastic  process  y 
can  be  extended  to  an  interval  [?,T]  where  t  <  0  so  that  the  covariance 
matrix  P(t)  of  a  realization  S  of  y  on  [f,T]  is  positive  definite  for 
t  c  [0,T] .  Hence,  the  restriction  of  S  to  [0,T]  belongs  to  S+.  (See 
[64]  and  [73]  for  details.) 

Let  S  e  S+.  Then,  by  Lemma  2.1,  x(t-l)  «  P(t)"1x(t)  is  defined 
for  every  t  «  [0,T]  and  satisfies  (2.8).  It  remains  to  obtain  a  "back¬ 
ward"  equation  for  y. 

Lanna  2.4.  Let  y  be  given  by  (2.1b).  Then,  y  can  be  written 

y(t)  •  G' (t)x(t)  ♦  0(t)w(t)  (2.22) 

where  G  ie  given  by  (2.19)  card  the  m  *  g-mztrix  function  5  is  given  by 
C(t)  -  [D(t)  -  H(t)F(t) ”^B(t)]  [I  -  B(t)  ,P(t+l)”1B(t)]^  (2.23) 

Proof.  Inserting  (2.19a)  into  (2.1b),  we  get 

y(t)  -  [G * (t)F’ (t)’^P(t)*1  -  D(t)B(t) 'F(t) ,”^P(t)”^]x(t)  +  D(t)w(t)  . 
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From  (2.8), 

x(t)  -  P(t)F(t)»P(t+l)_1x(t+l)  +  P(t)B(t)w(t)  . 

Using  this  expression  for  x(t)  in  the  above  equation  for  y,  we  obtain 
after  some  lengthy  algebraic  manipulations 

y(t)  *  G'  (t)P(t+l)"1x(t+l)  -  G’  (t)P(t+l)-1B(t)  [w(t)  -  B(t)F'  (tJ^PCtJ^xCt)] 
+  D(t)w(t)  -  D(t)B(t)  ’F'  (t)  ”1P(t)~1x(t) . 

=  G* (t)P(t+l)”1x(t+l) 

♦  [D(t)  -  G’ (t)P(t+l)”1B(t)] [w(t)  -  B’ (t)F* (t)~1P(t)”1x(t)]  . 

Using  (2.19a)  and  adding  and  subtracting  the  quantity  H(t)F(t)-1B(t) , 
we  get 

D(t)  -G(t)’P(t+l)'*1B(t)  * 

«  D(t)  -D(t)B(t)'P(t+l)"1B(t)  - H(t)F(t)-1B(t)  +  H(t)F(t) _1B(t) 

- H(t)P(t)F(t)  ,P(t+l)’*1B(t)  . 

«  D(t)  -  D(t)B(t) ,P(t+l)”1B(t)  -  H(t)F(t)_1B(t) 

+  H(t)”*F(t)”^  [P(t+1)  -  F(t)P(t)F(t)  ,]P(t+l)~1B(t)  . 

-  [D(t)  -HWPwSwHi.BW'Pd+D'Vt)] 

where  in  the  last  step,  we  employed  (2.3).  Then,  using  the  definition 
(2.10)  .for  w,  the  desired  result  follows.  0 

Combining  (2.8)  and  (2.22),  we  obtain  the  following  backward 

model 

x(t-l)  «  F(t)»x(t)  +  6(t)«(t)  X(T)  «£ 

(3)  (2.24) 

y(t)  -  G(t)'x(t)  +  C(t)w(t) 

* 
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where  B,  and  D  are  given  by  (2.9),  (2.10)  and  (2.23)  respectively 
and  £  *  P(T+l)""1x(T+l)  l  H(w).  The  state  covariance  function  P(t)  * 
P(t+1)“*  satisfies  (2.11).  We  shall  call  any  model  of  type  (2.24) 
with  y  as  its  output,  €  H,  for  i  =  1,  2,  ...»  n,  w  satisfying  (2.2) 
and  J.  H(w)  for  all  i,  a  backward  realization  of  y.  Note  that  S  and 

§  have  the  same  state  spaces,  i.e., 

Ht(x)  =  Ht^(x)  .  (2.25) 

for  each  t  c  [0,T+1]. 

It  is  essential  at  this  point  to  show  that  the  matrix  fi(t)  := 
D(t)D(t)’  is  positive  definite  wherever  the  matrix  R(t)  :»  D(t)D(t)* 
is.  To  this  end,  define  the  m  *  m-matrix  functions 

A  «  DD’  -  DB’F'~*H'  (2.26) 

A  *  DD'  -  DB*F”1G  .  (2.27) 

Lemma  2.5.  Let  A  and  A  be  defined  by  (2.26)  and  (2.27)  respectively. 
Then  A  *  A’ .  Furthermore ,  A  and  A  are  both  invariants  for  the  class  S. 

Proof.  Inserting  DD’  *  Aq  -  HPH*  (by  (2.16))  and  DB'  *  G'  -  HPF* 

(by  (2.19a))  into  (2.26),  we  obtain 

A  «  Aq  -  G'F* ”1H'  ,  (2.28) 

which  is  invariant  for  S.  Using  (2.19a),  (2.9)  and  (2.23),  it  can  be 
seen  that  the  matrix  function  H  can  be  written 


H»  «  F’PG  +  B5’  . 


(2.29) 
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Now,  by  the  above  argument  and  (2.24),  it  easily  follows  that 

A  =  Aq  -  HF_1G  (2.30) 

from  which  the  lemma  follows.  □ 

Proposition  2.6.  Let  A  and  A  be  defined  by  (2.26)  and  (2.27)  respec¬ 
tively.  Th en,  for  each  t  e  [0,1],  the  following  statements  are  equiva¬ 
lent:  (i)  D(t)D(t)'  >  0,  (ii)  A(t)  is  nonsingularj  (Hi)  A(t)  is 
nonsingulaTy  and  (iv)  5(t)6(t)’  >  0. 

Proof.  The  equivalence  between  (i)  and  (ii)  is  proved  by  Pavon 
([9;Theorem  3.2],  [62]);  this  proof  does  not  require  stationarity.  The 
same  argument  can  be  used  to  prove  the  equivalence  between  (iii)  and 
(iv).  Finally,  the  equivalence  between  (ii)  and  (iii)  follows  trivi¬ 
ally  from  Lemma  2.5.  □ 

In  analogy  with  the  forward  setting,  the  least-squares  estimate 
x*(t)  =  E(X(t)  i  H*+1(y)>  (2.31) 

is  generated  by  the  backward  Xalman-Bucy  filter: 

x*(t-l)  «F(t)'x*(t)  +Mt)Mt)‘}s[y(t)  -G(t)’X*(t)]  ;  X*(T)  -0,(2.32a), 
where 

6*  -  [F’Q*G  +  §D,]R^  ,  (2.32b) 

ft*  «  G'Q*G  +  5B»  ,  (2.32c) 

(which,  by  Proposition  2.6,  is  positive  definite  for  all  t  e  [0,T])  and 
the  error  covariance  matrix  function 
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Q*(t)  *  E([x(t)  -  x*(t)][x(t)  -  x*(t)]'}  (2.32d) 

being  the  solution  of  the  dual  matrix  Riccati  equation 

Q*(t-1)  *  F(t) »Q*(t)F(t)  -  B*(t)fi*(t) '  + B(t)B(t) •  ;  Q*(T)  *  PfT+l)"1,  (2.32e) 

with  B*  given  by  (2.32b). 

The  backward  innovation  process  w*  defined  by 

w*(t)  -  R*(t)"^[y(t)  -  G’(t)xjt))  (2.33) 

is  a  normalized  white  noise  satisfying  (2.2)  and  the  condition 

H*(w*)  *  H*(y).  The  covariance  matrix  P*(t)  :»  E{x*(t)x*(t) '}  satisfies 

the  backward  Liapunov  equation 

P*(t-1)  -F'(t)P.(t)F(t)  +  B*(t)B*(t)'  ;  P*(T)  *  0  (2.34) 

Again,  we  need  to  show  the  invariance  of  the  backward  filter. 

Lemma  2.7.  Let  R*  and  B*  be  given  by  (2.32c)  and  (2.32b)  respectively . 
Then, 

fi*  *  Aq  -  G»P*G  (2.35) 

and 

B*  «  (H»  -  F'P*G)1£*  ,  (2.36) 

i.e.t  R*,  8*,  and  hence  the  model  5*,  defined  by  (2.37)  below ,  are  all 
invariants  for  the  class  S. 

Proof.  It  is  easy  to  see  that  (2.35)  holds.  As  for  6*,  observe  that 
it  can  be  written 

g* «  (f’Pg  +  Be*  -  f^or;1*  , 
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from  which  (2.36)  follows,  upon  using  (2.29).  Hence  §*  does  not  depend 
on  5  (consequently  on  the  choice  of  S  e  $+.)  □ 


Now,  in  the  same  way  as  above,  define  S  to  be  the  class  of  all 
backward  realizations  5  having  (2.32)  as  their  backward  Kalman-Bucy 
filter,  and  let  S+  be  the  subclass  consisting  of  those  5  e  5  for  which 
N  :*  E{£5'}  >  0.  In  the  same  way  as  in  the  forward  setting,  it  is  seen 
that  the  realization 
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x*(t-D  *  F'(t)x*(t)  +  B*(t)w*(t)  ;  x*(T)  -  0 
y(t)  =  G*  (t)x*  (t)  +  Mt^wjt) 


(2.37) 


belongs  to  S.  By  Lemma  2.7,  the  class  S  is  uniquely  defined  in  terms  of 
the  invariants  F,  H,  G  and  Aq,  and  therefore,  the  backward  counterpart 
5  of  any  S  e  S+  belongs  to  S.  In  particular,  since  P(T+1)  is  positive 
definite  and  since  P(T)  =  P(T+1)_1,  S  e  S+.  Also,  note  that,  by  Pro¬ 
position  2.6,  56'  >  0  for  all  5  e  S+. 

It  is  clear  that  there  is  a  complete  symmetry  between  forward 
and  backward  realizations.  In  particular,  the  subclasses  S+  and  S+ 
are  in  one-one  correspondence.  Therefore,  in  the  following  lemma,  we 
summarize  the  procedure  for  constructing  a  forward  realization  corres¬ 
ponding  to  a  backward  one  in  S+.  This  lemma  is  the  counterpart  of 
Lemma  2.1. 


Lemma  2.8.  Let  (2.24)  be  an  arbitrary  baohaxrd  realization  in  .S+  with 
state  prooee8  x  and  state  covariance  function  P.  Then t  the  process  x, 
defined  by 
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x(t)  *  P(t-l)_1X(t-l)  (2.38) 

for  t  £  [0,T],  satisfies  the  forward  recursion 

x(t+l)  *  F(t)x(t)  +  B(t)w(t)  ;  x(0)  *  PC-D^Xf-l) ,  (2.39a) 

where 

B(t)  «  -POO^FftV^BCtHl  -  (2.40) 

and  w  is  a  p- dimensional  normalized  white  noise  sequence  satisfying 
(2.2)  and  the  condition  H*(w)  l  H^(x)  for  all  t  c  [0,T]  and  is  given  by 

w(t)  *  [I-B^’P^-n^BCt)]^!)  -§(t)F(t)'-1P(t)"1x(t)].  (2.41) 

Moreover ,  the  process  y  satisfies  the  recursion 

y(t)  *  H(t)x(t)  +  D(t)w(t)  ,  (2.39b) 

where  the  matrix  function  D  is  given  by 

D(t)  *  [D(t)  -  G(t) 'F(t) ,-1B(t)] [I  -  6(t) ,P(t-l)”1B(t)]  .  (2.42) 

Finally t  if  5  i  S+,  relations  (2. 38) -(2.43)  hold  for  all  t  e  [0,1],  for 
which  P(t-l)”*  exists. 

In  the  sequel,  we  shall  be  interested  in  obtaining  the  backward 
(forward)  counterpart  of  S*  (5*).  As  is  clear  by  now,  for  this  we  need 
to  invert  the  matrices  P*(t)  in  (2.18b)  and  P*(t-1)  in  (2.34).  However, 
since  P*(0)  *  0  and  P*(T)  *  0,  P*1  and  P*1  will  not  exist  on  the  whole 
interval  [0,T];  they  may  not  even  exist  on  part  of  it  for  that  matter. 
Therefore,  we  introduce  the  following 

Definition  2.9.  Let  t*  be  the  smallest  t  such  that  P*(t)  >  0;  if  there 
is  no  such  t  on  [0,T] ,  set  t*  :*  T+l.  Similarly,  let  t*  be  the  largest 
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t  such  that  P*(t-1)  >  0;  if  there  is  no  such  t,  set  t*  =  -1. 

As  is  clear  from  the  definition  above,  t*  and  t*  might  lie  out¬ 
side  the  interval  [0,T].  However,  if  we  impose  some  more  conditions 
on  the  class  S,  we  can  guarantee  that  t*  and  t*  belong  to  [0,T], 

Definition  2.10.  The  class  S  is  said  to  be  minimal  if  there  is  no 
realization  of  type  (2.1),  the  state  process  of  which  has  dimension 
smaller  than  n. 

Lemma  2.11.  S  is  minimal  if  and  only  if  S  is  minimal. 

Proof.  Let  S  be  minimal.  Assume  that  there  is  a  backward  realization 
5  e  5  such  that  5  is  not  minimal.  Then  all  5  in  the  (nonempty)  sub¬ 
class  S+  are  also  nonminimal,  and  consequently,  by  Lemma  2.8,  we  could 
construct  a  nonminimal  forward  realization  from  such  an  5,  contradict¬ 
ing  the  minimality  assumption  of  S.  The  converse  follows  analogously.  □ 

Lemma  2.12.  Let  S  be  minimal.  Then  t,  i  T  and  t*  £  0. 

Proof.  Since  S*  c  S,  S*  is  minimal.  Hence,  the  pair  (F,B*)  is  com¬ 
pletely  controllable  on  the  interval  [0,T]  [63].  In  fact,  were  this  not 
the  case,  the  input -output  map  of  S*  could  be  reduced  [4]  contradicting 
minimality.  Therefore,  the  controllability  gramian 

W*(0,t;)«  l  Ht*,j+l)B*(j)B*(j)»$(t*,j+l)' 
j*0 


(2.43) 
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is  positive  definite  for  some  t*  e  [0,T].  Consequently,  upon  writing 
the  solution  P*  of  (2.18b)  as  in  (2.5b),  we  see  that  P*(t)  >  0  for  all 
t  e  [t*,T],  Now,  if  S  is  minimal,  5  is  minimal  also  by  Lemma  2.11. 
Since  §*  €  S,  we  can  analogously  show  that  t*  i  0.  0 

Of  course,  there  is  no  guarantee  that  there  is  a  t  e  [0,T]  for 
which  both  P*(t)  and  P*(t-1)  are  positive  definite. 

Definition  2.13.  The  class  S  is  said  to  be  regular  if  t*  s  t*+l,  i.e. , 
for  each  t  e  [0,T],  either  P*(t)  >  0  or  P*(t-1)  >  0  (or  both),  and  is 
said  to  be  irregular  otherwise. 

In  fact,  the  regularity  property  of  the  class  S  depends  to  a 
certain  extent  on  the  length  of  the  interval  [0,T];  for  if  T  is  suffi¬ 
ciently  large  compared  with  n,  S  will  be  regular,  since  then  the  con¬ 
trollability  gramian  (2.43)  will  eventually  become  positive  definite; 
the  same  holds  in  the  backward  direction  for  the  controllability 
gramian  W*(T,tw)  of  S*.  If  T  <  n,  we  will  encounter  irregularity;  then 
we  do  not  have  minimality  either. 

2.4.  The  Frame  Space 

Now,  we  are  ready  to  justify  introducing  the  two  processes  x*  and 
x*.  It  follows  from  (2.15)  that 

Ht(x*)  =  E{Ht(x)  |  Ht_1(y)}  c  H^y)  . 

Also,  by  (2.25)  and  (2.31),  we  have 
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Ht_l(xJ  *  E{Ht(x)  |  H*(y)}  c  H*(y)  , 
for  all  t  e.  [0,T],  Define  the  orthogonal  complements 

Nt  :=  Ht-lCy)  9  and  Nt  :=  Ht(y)  9  Ht-lCX*3  * 

Then,  H(y)  can  be  decomposed  as 

H(y)  *  N“  •  Hj  *  N*  ,  (2.44) 

where  H“  is  the  frame  space  [53,54,56] 

Hj  =  Ht (xj  VHt_1  (xj  (2.45) 

for  all  t  €  [0,T] . 

Lemma  2.14,  Let  x  be  the  state  process  of  a  realization  S  e  S.  Then 

Ht(x)  c  H®  0  tH(y)  ]x  (2.46) 

for  all  t  e  [0,T], 

Proof.  See  the  proof  of  Lemma  3.7  in  [64],  □ 

Notice  that  therefore,  the  smoothing  estimate  (2.4)  will  always 
be  contained  in  the  frame  space,  hence  its  importance. 

In  the  continuous-time  setting  of  our  papers  [64,65],  the  frame 
space  has  the  constant  dimension  2n  on  the  open  interval  (0,T).  This 
however  will  not  be  the  case  here,  and  this  contributes  to  the  fact  that 
the  discrete-time  results  are  nontrivial  modifications  of  the  continuous¬ 
time  ones.  To  see  this,  first  note  that  the  dimensions  of  H  (x*)  and 

t 

Ht_i(5c*)  can  be  related  to  the  ranks  of  P*  and  P*,  and  hence  to  the  con¬ 
dition  of  regularity  through  the  following 
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Lemma  2.15.  Let  x  be  a  stochastic  vector  with  covariance  P  and  let 
H(x)  he  the  span  in  H  of  its  components.  Then 

dim  H(x)  *  rank  P  .  (2.47) 

be  an  orthonormal  basis 

in  H(x) .  Then  x  *  +  •••  +  f°r  some  & 2 »  •••>  ^r€ 

Define  the  n  *  r-matrix  L  =  (5,^,  , ...  i.y) .  Then  x  *  L£,  and  hence 

P  *  LL' .  Consequently,  rank  P  i  r,  with  equality  if  and  only  if  L  is 

T— 1 

full  rank.  But  L  must  be  full  rank,  because  otherwise  l  *  E,  .  a,  Z. 

*  r  k=l  k  k 

for  some  a.,  a_,  ...,  a  e  ]R  .  Then  x  *  where  £.  * 

12  r  i*l  ii  i 

^i  +  ai  *  i  *  1*  2»  •••>  r“l*  Hence,  H(x)  is  the  span  of  the  r-1  random 

A  /V  /\ 

variables  {f^,  £2,  .....  £^1,  which  contradicts  the  fact  that  dim  H(x)  *  r. 

Consequently,  the  dimensions  of  H^Cx*)  and  Ht_1(x*)  vary  between 
0  and  n.  When  S  is  regular,  the  lemma  implies  that,  for  each  t,  at  least 
one  of  these  spaces  has  dimension  n  and  that  there  are  some  t‘s  for 
which  both  of  them  have  dimension  n.  Consequently,  for  each  t  e  [0,T], 
n  £  dim  s  2n,  where  each  limit  is  attained  for  some  t.  (To  see  this, 
we  note  that  under  the  given  conditions,  it  can  be  shown  that 
Ht(x*)  n  Ht_1(x*)  =  {0}.)  On  the  other  hand,  if  S  is  irregular, 
dim  H®  <  n  on  the  whole  interval. 

It  will  be  more  convenient  in  the  sequel  to  express  the  frame 
space  H®  in  a  somewhat  more  symmetric  form.  To  this  end,  we  shall  con- 
struct  the  forward  counterpart  of  5*.  However,  since  P*(t-1)  is  not 


Proof.  Set  r  :=  dim  H(x)  and  let  £  =  ^2 

fr 
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positive  definite  for  every  t  e  [0,T],  the  previous  argument  of  con¬ 
structing  a  backward  realization  from  a  forward  one  cannot  be  reversed 
on  the  whole  interval.  Therefore,  we  apply  the  generalized  Moore - 
Penrose  pseudo-inverse  (see  e.g.  [13]). 

Definition  2.16.  Let  P  be  any  matrix.  The  generalized  Moore-Penrose 
pseudo-inverse  P#  of  P  is  the  unique  matrix  satisfying 

(i)  PP#P  *  P,  (ii)  P#PP#  =  P#,  (iii)  (PP#)*  =  PP#,  and 

(iv)  (P#P)'  *  P#P  .  (2.48) 

(Hence,  if  P  is  nonsingular,  Pff  *  P  .)  In  the  sequel,  we  shall  need 
the  following 

Lemma  2.17.  Let  x  be  a  stochastic  vector  with  covariance  matrix  P. 

Then 

PP#x  *  x  .  (2.49) 


Proof.  If  x  =  0,  the  statement  is  trivial;  hence  assume  that  x  *  0. 
By  the  Singular  Value  Decomposition  Theorem  [66],  there  exists  an  or¬ 


thogonal  matrix  V  (i.e.,  W1 


Pj^  >  0.  Then  x  can  be  written  V 


I)  such  that  P  =  V 

*i 


?i  o 
0  0 


V*  where 


where  x^  is  stochastic  vector 


with  covariance  P^.  Also,  it  is  easy  to  see  that  P#  * 
Relation  (2.49)  then  follows  by  direct  multiplication. 


v  piX  0  V’. 
0  0 


□ 


Now,  define 


x*(t)  -  P*(t-l)#**(t-l) 


(2.50) 


for  each  t  e  [0,T] .  Then,  premultiplying  both  sides  of  (2.50)  by 
P*(t-1)  and  applying  Lemma  2.17,  we  obtain 

x*(t-l)  *  P*(t-l)x*(t)  .  (2.51) 

Hence  Ht(x*)  *  ^  (x*)  and  consequently,  we  have  the  following  symme¬ 

tric  expression 

*  Ht(xJVHt(x*)  (2.52) 

for  the  frame  space. 

The  next  section  will  be  devoted  to  finding  a  forward  recursion 
for  x*. 

Of  course,  we  can  equally  well  have  a  symmetric  expression  for 
H®  involving  two  processes  that  are  the  states  of  backward  realizations 
via 

Ht  *  VHt.iCX*)  (2.53) 

where  x*  is  defined  by 

**(t-l)  *»  P*(t)#x*(t)  (2.54) 

for  all  t  c  [0,T]. 

2.5.  The  Model  S* 

The  aim  of  this  section  is  to  construct  and  study  the  properties 
of  the  model  S*  whose  state  process  is  x*  defined  by  (2.50).  As  we 
mentioned  earlier,  since  5*  4  S+,  the  forward-backward  construction  of 
Lemmas  2.1  and  2.8  cannot  be  applied  here.  However,  the  basic  idea  of 
the  derivation  is  the  same,  but  the  results  will  be  somewhat  different. 
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Although  for  our  purposes  we  only  need  to  construct  the  forward  model 
(S*)  corresponding  to  (§*),  nevertheless,  and  for  completeness  only, 
we  shall  do  the  reverse  procedure,  i.e.,  also  construct  a  backward  model 
corresponding  to  any  S  e  S  (note  not  in  S+) . 

To  do  this,  we  need  the  following  three  lemmas.  The  first  of 
these  is  the  natural  generalization  of  Lemma  2.2  and  can  be  found  in 
most  standard  texts.  The  other  two  are  generalizations  of  results  in 
[9,67]. 

Lemma  2.18.  Let  u  and  v  be  two  stochastic  vectors  with  components  in  H. 
Then 

i{u  |  v)  -  E{uv‘}(E{w«})*v  .  (2.55) 

Lemma  2.19.  (a)  Let  P  be  the  state  covariance  function  of  any  S  e  S. 

Then 

P(t)F(t) '  «  P(t)F(t)'P(t+l)#P(t+l)  (2.56) 

for  all  t  e  [0,T], 

(b)  Let  P  be  the  state  covariance  function  of  any  5  e  S. 

Then 

P(t)F(t)  «  P(t)F(t)P(t-l)#P(t-l)  (2.57) 

for  all  t  e  [0,T]. 

Proof,  (a)  Postmultiply  both  sides  of  (2.12)  by  x(t+l)  and  taking  co- 
variances,  Lemma  2.18  and  the  fact  that  the  components  of  x(t+l)  are 
orthogonal  to  those  of  the  second  term  of  the  right-hand  side  of  (2.12) 
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yield  (2.56). 

(b)  The  proof  is  analogous  to  that  of  (a) .  D 

Lemma  2.20.  Let  P  be  as  in  the  previous  lemma.  Then 

P(t+l)P(t+l)#B(t)  «  B(t)  (2.58) 

for  all  t  c  [0,T], 

Proof.  To  prove  this  lemma,  we  follow  [67).  First,  premultiply  (2.1a) 
by  P(t+l)P(t+l)*  to  obtain 

P(t+l)P(t+l)#x(t+l)  «P(t+l)P(t+l)#F(t)x(t)  +P(t+l)P(t+l)#B(t)w(t),  (2.59) 

and  observe  that  the  left-hand  side  of  (2.59)  is  x(t+l)  (by  Lemma  2.17). 
Next,  reformulate  (2.56)  to  read 

P(t+l)P(t+l)#F(t)P(t)  «  F(t)P(t)  . 

Postmultiplying  this  by  P(t)#x(t)  and  using  Lemma  2.17  again,  we  see  that 
the  first  term- of  the  right-hand  side  of  (2.59)  is  F(t)x(t).  Comparing 
(2.59)  with  (2.1a),  one  obtains 

B(t)w(t)  «  P(t+l)P(t+l)#B(t)w(t)  , 

which  postmult iplied  by  w(t)*  and  taking  expectations  yields  (2.58).  □ 

Now,  we  are  ready  to  state  the  first  main  result  of  this  section, 
which  is  the  analogue  of  Lemma  2.1  and  which  provides  us  with  a  back¬ 
ward  counterpart  of  any  S  e  S  on  the  whole  interval  [0,T]. 


Proposition  2.21.  Let  S  be  an  arbitrary  realization  in  S  and  let  the 


matrix  function  0  be  given  by 

0(t)  «  I  +  B(t)»F(t)»“1P(t)#F(t)"1B(t)  .  (2.60) 

Then  S  has  the  following  backward  counterpart 

x(t)  *  P(t)F(t) *P(t+l)#x(t+l)  +  P(t)B(t)w(t) 

(S)  _  (2.61) 

y(t)  *  6(t)*x(t+l)  +  D(t)w(t) 

» 

on  [o,T]j  where 

B(t)  «  -P(t)#F(t)-1B(t)[I  -  B(t)'P(t+l)#B(t)]0(t)*  (2.62a) 

D(t)  *  [D(t)  -  G(t)’P(t+l)#B(t)]0(t)Js  (2.62b) 

G(t)  «  F(t)P(t)#P(t)F(t)-1G(t)  (2.62c) 

and  w  is  a  white  noise  satisfying  (2.2)  such  that  H”(w)  l  H*(x)  given 

by, 

w(t)  -  0(t)_1*[w(t)  -  B(t)  *F(t)  *_1P(t)#x(t)3  .  (2.63) 


Proof.  Applying  the  orthogonal  decomposition  (2.12)  to  (2.1a),  we 
obtain 

x(t)  «P(t)F(t)'P(t+l)#x(t+l)  +  [x(t)  -P(t)F(t)»P(t+l)#x(t+l)].(2.64) 

Many,  but  straight-forward  algebraic  manipulations,  applying  Lemmas 
2.17,  2.19  and  2.20,  yield  the  first  of  relations  (2.61).  The  argu¬ 
ment  of  Lemma  2.4  can  now  be  used  to  prove  the  second  relation  in 
(2.61).  D 

Remark.  If  we  define  X(t-l)  «  P(t)*x(t),  we  could,  using  (2.61),  obtain 


a  backward  model  whose  state  is  x;  but  the  model  obtained  will  not,  in 
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general,  belong  to  S  since  P(t)  P(t)F(t)'  and  G  are  not  equal  to  F(t) ' 
and  G.  If  P(t)  >  0,  (2.61)  premultiplied  by  P(t)"*  will  be  (2.24). 

Using  an  analogous  argument  in  the  backward  setting,  the  following 
proposition  can  be  easily  proved. 

Proposition  2.22.  Let  §*  and  x*  be  given  by  (2.37)  and  (2.50)  respec- 
tively  and  define  0*  as 

8*(t)  *  I  +  B*(t)»F(t)'1P*(t)#F(t)»‘15^(t)  .  (2.65) 

Then  x*  ie  the  state  process  of  the  following  model t  which  is  the  for¬ 
ward  counterpart  of  5*. 

fx* (t+1)  «n(t+l)F(t)x*(t)  +H(t+l)B*(t)w*(t)  ;  x*(0)  «P*(-l)#x*(-l) 


(S*)  1  k  k 

1  y(t)  -  H(t)x*(t)  +  R*(t)V(t)  ,  (2.66) 

where 

B*(t)  *  -P*(t)#F(t)’'16*(t)[I-5*(t)«Pjt-l)#Sjt)]0*(t)J5,  (2.67a) 

R* (t)*5  *  [&*(»)*  -  H(t)f*(t-l)#B*(t)]0*(t),s  ,  (2.67b) 

-  H(t)  »  H(t)F(t)'1n(t+l)F(t)  ,  (2.67c) 

w *  ie  a  white  noise  satisfying  (2.2)  and  the  forward  property 
H*(w*)  i  H^(x*)  given  by 

W*(t)  -  ©*(t)“*SI«* (t)  -  6*(t)’F(t)“1P#(t)#xJt)  ,  (2.68) 

and  the  n  *  n -matrix  function  II  is  defined  by 

n(t)  :«  P*(t)P*(t)#  ,  (2.69) 


where  ?*  is  the  covariance  function  of  x*3  i.e.,  P*(t)  ■  E{x*(t)x*(t) 
which  satisfies  the  Liapunov  type  equation 


'»i  ii  u4iijiiH.ijuai.ii 
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P*(t+1)  *n(t+l)F(t)P*(t)F(t)'II(t+l)  +B*(t)B*(t)'  ;  P*(0)  =  P*(-1)#.  (2.70) 

Proof.  First,  in  a  manner  analogous  to  that  of  Proposition  2.21,  it 
can  easily  be  seen  that  x*  is  the  state  process  of  the  forward  realiza¬ 
tion 

x*(t)  -  P*(t)F(t)P*(t-l)#x*(t-l)  ♦  P*(t)B*(t)w*(t) 

\  .  k  (2.71) 

y(t)  «  H(t)P*(t-l)#x*(t-l)  +  R*(t)V(t)  ,  (2.71) 

k 

where  B*,  w*,  H  and  R*  are  given  above.  Premultiply  the  first  equation 
in  (2.71)  by  P*(t)#,  observe  that  P*(t)#  *  P*(t+1)  and  use  (2.50), 

(2.48),  and  (2.69)  to  obtain  (2.66).  Finally,  (2.70)  follows  from  the 
state  equation  in  (2.66).  □ 

The  matrix  function  II  defined  by  (2.69)  will  play  an  important 

2 

role  in  what  follows.  Relations  (2.48)  imply  that  II  *  II  and  that 
II  «  II',  hence  II  is  an  orthogonal  projection.  Also,  Lemma  2.17  yields 

JI(t)x*(t)  *  x* (t)  (2.72) 

for  all  t  e  [0,T+1].  Finally,  if  5  is  minimal,  S  is  also  minimal 
(Lemma  2.11),  in  which  case  Lemma  2.12  guarantees  the  existence  of  a 
t*  e  [0,T]  such  that  P*(t)  >  0  for  all  t  c  [0,t*].  Then  II  =  I  on  [0,t*]; 
in  this  case,  x*  will  satisfy  a  recursion  of  type  (2.1a),  and  the  fol¬ 
lowing  relation 

P*(t)  *  P(t)  s  P*(t)  (2.73) 

holds  for  each  such  t.  (In  fact,  P*(t)  S  P(t)  holds  for  all  t  e  [0 ,T] . ) 


Finally,  the  following  three  lemmas  will  be  needed  in  the  sequel. 
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Lenina  2.23.  Let  x  be  the  state  process  and  P  the  state  covariance 
function  of  any  S  e  S+  and  let  x*  and  x*  be  defined  by  (2.15)  and  (2.50) 
respectively.  Then 

E(x(t)x*(t)0  -  P*(t)  (2.74) 

and 


E(x(t)x*(t)*}  -  P(t)H(t)  .  (2.75) 


for  all  t  €  [0,T+1] . 


Proof.  In  view  of  the  definition  (2.15),  Ht(x  -  x*)  l  Ht(x*)  and 
therefore,  (2.74)  follows.  Since  S  e  S+,  5  c  S+,  and  hence  the  backward 
counterpart  of  (2.74)  reads  E{x(t)x*(t) •}  «  P*(t).  But  x*(t)  » 
P*(t)x*(t-1),  and  hence  E{x(t)x*(t) ’}  *  P(t)E{x(t-l)x*(t-l) •}P*(t), 
which  yields  (2.75).  0 

Lemma  2.24.  Assume  that  S  is  regular.  Let  x*  and  x*  be  defined  by 
(2.16)  and  (2.66)  respectively.  Then»  for  all  t  e  [0,T+1], 

E{x* (t)  J  x*(t)}  *  n(t)x*(t)  (2.76) 

and 

E{x*(t)x*(t)»}  -P*(t)n(t)  .  (2.77) 

Proof.  Since  S  is  regular,  the  two  intervals  [0,t*]  and  [t*,T+l]  cover 
the  whole  interval  [0,T+1].  On  [0,t*],  P*(t-1)  >  0,  and  consequently 
S*  has  all  the  properties  of  realizations  in  S+  on  that  interval . 

Since  II  =  I  on  [0,tj,  this  implies  that  (2.76)  and  (2.77)  hold  there. 
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Similarly,  on  [t*,T+l],  P*(t)  >  0,  and  therefore,  analogously  with  the 
above,  E{x*(t-l)x*(t-l) '}  *  P*(t-1)  on  this  interval.  Premultiply  this 
by  P*(t)  and  postmultiply  by  P*(t)  and  remembering  that  P*(t-1)  *  P*(t)*, 
(2.77)  is  seen  to  hold  on  [t*,T+l].  Then  Lemma  2.18  provides  the  re¬ 
quested  formula  (2.76)  on  this  interval.  □ 

Lemma  2.25.  Let  S  e  S+  and  let  Q*  be  the  covariance  function  of 
II(x*  -  x).  Then 

Q*  *  JI(P*  -  P)n  .  (2.78) 

If  in  addition  S  is  regular ,  then 

E{[x  -  xj[x*  -  x]»n}  »  0  .  (2.79) 

Proof.  Relation  (2.78)  follows  from  (2.75).  Relation  (2.79)  follows 
from  (2.74),  (2.75)  and  (2.77).  □ 

Remark.  Since  II  is  projection,  IIQ*II  *  Q*. 

2.6.  A  Mayne-Fraser-Type  Smoothing  Formula 

Given  the  state  process  x  and  the  output  process  y  of  a  model 
(2.1),  the  smoothing  problem  consists  in  determining  the  smoothing 
estimate 

*(t)  -  E{x(t)  |  H(y)}  (2.80) 

of  x  and  the  smoothing  error  covariance 


Z(t)  -  E{[x(t)  -  *(t)][x(t)  -  *(t)]'} 


(2.81) 
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for  all  t  e  [0,T],  In  this  section,  we  shall  derive  a  smoothing  formula 
for  the  case  that  5  is  regular.  At  the  end  of  the  section,  we  shall 
present  a  conjecture  which,  if  true,  would  allow  us  to  remove  this  regu¬ 
larity  assumption.  Our  approach  w’.ll  utilize  an  orthogonal  decomposition 
of  the  frame  space  H°  to  be  given  below. 

In  view  of  Lemma  2.14,  £(t)  e  H°.  In  order  to  obtain  a  formula 
for  x,  we  need  to  decompose  the  frame  space  H°  perpendicularly.  To  this 
end,  we  introduce  the  process  z  defined  by 

z (t)  *  x*(t)  -  II(t)x*(t)  (2.82) 

for  all  t  c  [0,T+1] . 

Lemma  2.26.  Assume  that  the  class  S  is  regular.  Let  z  he  defined  by 
(2.82),  and  lei  Q(t)  :*  E{z(t)z(t) '}.  Then 

*  Ht(x*)  •  Ht(z)  (2.83) 

for  all  t  e  [0,T+1].  Moreover t 

Q  =  1I(P*  -  P*)I!  (2.84) 

and 

Q  *  h(Q*  +  Q*)H  (2.85) 

where  Q*  and  Q*  are  given  by  (2.18c)  and  (2.78)  respectively  and  P  is 
the  state  covariance  of  any  S  e  S. 

Proof.  Since  S  is  regular.  Lemma  2.24  implies  that  the  components  of  z 
are  orthogonal  to  those  of  x*>  and  therefore  (2.83)  follows  from  (2.52). 
Next,  since  x*  «  Jbc* ,  z  *  JI(x*  -  x*).  Then  (2.84)  is  a  direct 
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consequence  of  (2.77).  Finally,  write  Q  =  II(P*  -  P  +  P  -  P*)II  to  obtain 
(2.85).  □ 


Lemma  2.27.  Let  z  be  given  by  (2.82)  and  Q  be  its  covariance  function. 
Then 


and 


Q  ■  IIQ  ■  QU  ■  IIQH  , 

(2.86a) 

q#  =  iiq#  =  Q#n  *  nQ#n 

(2.86b) 

QQ#  =  Q#Q  *  n  . 

(2.86c) 

Proof.  We  shall  first  show  that  Q  *  IIQII  and  Q*  *  IIQ#II,  the  first  of 
which  follows  trivially  from  (2.84)  and  the  fact  that  H  is  a  projection. 
For  the  second  relation,  let  t  6  [0,T]  be  fixed.  The  case  P*(t)  *  0 
is  trivial  (for  then  Q(t)  *  0  and  II(t)  *  0);  hence  we  shall  assume  that 
P*(t)  *  0.  As  mentioned  in  Lemma  2.17,  there  exists  an  orthogonal 
matrix  V  such  that  P*(t)  *  Vj|*^  ^Jv»  where  P*  >  0.  Then  P*(t)#  * 
v[£  ^  •>  and  II(t)  ®  v|q  qj^’*  view  °f  the  fact  that  Q  *  IIQII, 

Q(t)  can  be  written  Q(t)  =  ^jv.  We  want  to  show  that  Q(t)  >  0. 

But  in  view  of  (2.85),  this  must  be  the  case,  because  if  we  choose  S  in 
S+,  Q*(t)  >  0  and  Q*(t)  i0.  Hence  Q(t)#  »  V  Jv  and  conse- 

#  M 

quently,  Q  *  IK?' II.  From  the  above  discussion  it  is  clear  that  (2.86c) 
holds.  The  rest  of  relations  (2.86)  follow  trivially  by  remembering 
that  II  is  a  projection.  □ 


Now,  we  are  ready  to  present  the  main  result  of  this  chapter. 
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Theorem  2.28.  Assume  that  xihe  class  S  is  regular.  Let  x  he  the  state 
process  of  any  S  of  class  S+.  Then  the  smoothing  estimate  (2.80)  is 
given  by 

x(t)  =  [I  -  Q*(t)Q(t)#]x*(t)  ♦  Q*(t)Q#(t)x*(t)  (2.87) 

and  the  error  covariance  (2.81)  by 

2(t)  '  Q*(t)  -  Q*(t)Q(t)#Q# (t)  (2.88) 

for  all  t  €  [0,T+1] . 

Proof.  Since  x(t)  *  £{x(t)  |  H®}  (Lemma  2.14),  (2.83)  yields 

x(t)  *  E{x(t)  |  x*(t)}  +  E(x(t)  |  z(t)}  , 

which  upon  using  (2.15)  and  Lemma  2.18  can  be  written 

x(t)  -  x*(t)  +  E{x(t)z(t)’}Q(t)#z(t)  . 

In  view  of  (2.74)  and  (2.75),  E{x(t)z(t)‘)  *  Q*(t)ll(t).  Since  z  «  Hz, 
(2.87)  follows  from  the  above  relation  noting  (2.86).  To  prove  (2.88), 
observe  that 

x  -  x  =  (I  -  Q*Q*)(x  -  x*)  ♦  Q*Q#(x  -  x*)  . 

Replacing  Q*  by  IIQ#n,  and  noting  that  the  two  terms  above  are,  in  view 
of  (2.79),  orthogonal,  we  obtain 

s  «  Q*  -  Q*nQ*no*  -  q*iiq#iiq* 

+  Q*nQ#nQ*nQ#nc^  +  Q*nQ#nQ*nQ#iiQ*  , 
which,  in  view  of  (2.85)  and  (2.86),  yields  (2.88).  □ 
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To  obtain  the  discrete-tine  version  of  the  Mayne-Fraser  formula 
[38,39],  the  following  lama  is  needed. 

Lemma  2.29.  Let  S  c  S+.  Then t  for  each  t  e  [0,T+1],  Q*(t)  and  E(t) 
are  positive  definite  and  satisfy 

ICt)’1  *  Q* (t) _1  ♦  Q*(t)#  (2.89) 

where  Q*  is  given  by  (2.78). 

Proof.  Let  P  be  the  covariance  function  of  S.  Since  S  €  S+,  Q*(0)  « 
P(0)  -  P*(0)  ■  N  >  0,  consequently  Q*(t)  >0  for  all  t  c  [0,T+1].  To 
see  this,  observe  that  the  Riccati  equation  (2.16e)  can  be  reformulated 
to  read 

Q*(t+i)  -  r,(t)Q*(t)r,(t)' 

♦  (B*(t)R*(t)"lsD(t)  -B(t))(B*(t)R,(t)JsD(t)  -B(t))’-,  (2.90) 

where  T*  is  the  feedback  matrix 

T*  -  F  -  B*R^H  .  (2.91) 

The  Liapunov -type  equation  (2.90)  can  be  written  in  the  form  (2.5b)  to 
yield  Q*(t)  >0  for  all  t  e  [0,T+1],  since  Q*(0)  >  0.  The  same  argu- 
emtn  can  be  used  to  prove  that  Z  >  0:  first  determine  l  from  the 
Liapunov-type  equation  corresponding  to  the  backward  representation 
(2.115)  below  (the  proof  of  which,  of  course,  does  not  depend  on  this 
lemma);  then,  as  above,  note  that  for  all  t  «  [0,T],  I(t)  *  E(T+1)  * 
Q*(T+1)  >  0.  This  proves  the  positivity  of  I.  Next,  it  follows  from 
(2.88)  that 


I  »W<-Wi 
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I  -  Q*Q#  8  2Q*1  (2.92) 

which  is  nonsingular  for  all  t  £  [0,T+1].  Then  applying  the  matrix 
inversion  lemma  (1.34)  to  (2.88),  it  is  seen  that  E~*  *  Q~*  +  Z,  where 

Z  -  Q#(I  -  Q*Q#)“1  «  (I  -  Q#Q*)_1Q#  .  (2.93) 

Therefore,  it  just  remains  to  show  that  Z  *  Q*#.  Since  II  is  a  projec¬ 
tion,  (2.85)  yields 

Q*  =  Q  -  nQ*n  .  (2.94) 

In  view  of  Lemma  2.27,  (2.94)  and  the  first  of  relations  (2.93)  yield 
Q*Z  *  II,  which  is  symmetric.  Likewise,  using  the  second  of  relations 
(2.93),  we  obtain  ZQ*  *  II.  Then,  Lemma  2.27  implies  that  Q*ZQ*  «  Q* 
and  ZQ*Z  *  Z  also.  Consequently,  by  Definition  2.16,  Z  *  Q*#.  □ 

Now,  we  are  in  a  position  to  state  the  Mayne-Fraser  two-filter 
formula. 

Theorem  2.30.  Let  S  be  regular  and  let  S  €  S+.  Then  the  smoothing 
estimate  x  of  the  state  x  of  S  is  given  by 

x(t)  »  Z(t)[Q,(t)_1x*(t)  +  Q*(t)#x*(t)]  (2.95) 

where  x*,  x*  and  Z  are  given  by  (2.16),  (2.66)  and  (2.89)  respectively. 

Proof.  It  follows  from  (2.89)  and  (2.92)  that  Q*Q*  «  I  -  EQ^1  * 

*»  EfE”1  -  Q^1)  «  ZQ*#.  This  together  with  (2.92)  yields  (2.95)  when 

□ 


inserted  in  (2.88). 
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Although  the  regularity  assumption  imposed  on  the  class  $  is  not 
a  very  stringent  one  (especially  if  T  is  large  enough  in  comparison  with 


n),  it  would  be  interesting  to  see  if  it  can  be  removed.  Actually, 
this  assumption  was  introduced  only  to  obtain  relation  (2.76).  We  be¬ 
lieve  that  (2.76)  holds  without  the  regularity  assumption;  as  expressed 
in  the  following 

Conjecture.  Let  x*  and  x*  be  defined  by  (2.15)  and  (2.50)  respectively. 
Then 

E{x*(t)  |  x*(t)}  *  n(t)x*(t)  .  (2.96) 


Justification.  First,  note  that  E(x*(t)  |  x*(t)}  *  E{x*(t)  (H'^Cy)}. 
To  see  this,  note  that  (y)  *  Ht(x*)  •  N~  where  N”  X  Ht(x*'),  for 
N~  l  H®.  Now,  let  x+  be  defined  by 

fx+(t+l)  «  F(t)x+(t)  +  B* (t)w*(t) 
y(t)  «  H(t)x+(t)  +  R*(t)^w*(t)  . 


(S+) 


(2.97) 


Then,  noting  that  (2.57)  implies  n(t+l)F(t)II(t)  *  JI(t+l)F(t)  and 
H(t)  *  H(t)II(t),  we  obtain 

x*(t)  -  n(t)x+(t)  . 


(2.98) 


Hence,  (2.96)  is  equivalent  to  showing  that  x+(t)  :* 

E{x+(t)  |  H^_1(y)}  «  x*(t)  .  But  x+(t)  is  generated  by  the  Kalman- 
Bucy  filter 

x+(t+l)  «F(t)x+(t)  +  K(t)R+(t)Js[y(t)  -H(t)x+(t)]  ;  x+(0)«0,  (2.99) 

where  K  is  the  gain  function  and  R+  is  given  by 

•f  . 

R  ■  A  -  HPH*  , 


(2.100) 
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AQ(t)  ■  E{y(t)y(t)'}  and  P(t)  *  E{x+(t)x+(t) '}.  Using  the  output 
equations  of  (2.3,8a)  and  (2.91)  alternatively  to  compute  §{y(t)  |  H~  ^ (y) } 
we  easily  see  that 

Hx*  ■  Hx+  ,  (2.101) 

which  implies  that  HP*H*  «  H?H?.  Hence  R+  *  R#  and  the  process 
R+(t)"Js[y(t)  -  H(t)x+(t))  is  the  innovation  process  w*(t)  defined  by 
(2.17).  To  justify  the  conjecture,  it  then  only  remains  to  show  that 
the  gain  functions  K  and  B»  are  the  same.  Due  to  time  limitations,  we 
shall  leave  this  open.  □ 


2.7.  The  Bryson-Frazler  Formulation 

In  this  section  we  shall  derive  the  discrete-time  version  of  the 
Bryson-Frazier  smoothing  formula  [35].  This  will  be  done  by  using  a 
procedure,  based  on  an  orthogonal  decomposition  of  H(y),  which  does  not 
require  that  S  be  regular.  Then  the  smoothing  formula  of  Rauch,  Tung 
and  Striebel  [36]  will  be  obtained  as  a  corollary. 

Since  H(y)  ■  H(w*)  »  H”_1(w<l)  9  H*(w#)  (for  w*  is  a  white  noise) 
and  H"_x(w*)  -  H“_x(y) 

H(y)  “  Ht-l(y)  #  Ht(w*}  *  (2.102) 

and  consequently,  (2.80)  yields 


x(t)  -  E{x(t)  |  (y)}  +  E(x(t)  |  H+(wJ}  , 
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which  in  view  of  (2. IS)  and  the  orthogonality  between  x*(t)  and  H+(w*) 
can  be  written 

x(t)  «  x*(t)  +  E{z*(t)  |  H*(wJ}  (2.103) 

where 

z*(t)  «  x(t)  -  x*(t)  .  (2.104) 

This  stochastic  process  satisfies  the  forward  recursion 

z*(t+l)  »r*(t)z*(t)  ♦  [B(t)  -  B^tJRJt^D^Mt),  (2.105) 

where  T*  is  the  feedback  matrix  function  (2.91).  To  see  this,  first 
note  that  (2.16a)  can  be  written 

x*(t+l)  »  T*(t)x#(t)  +  B^OOMt^yOO  ; 

then  insert  (2.1b)  into  this  and  subtract  from  (2.1a)  to  obtain  v\105). 
Moreover,  we  see  that  the  covariance  function  of  z„  is  precisely  Q*  as 
defined  in  Section  2.3. 

However,  to  evaluate  the  second  term  of  (2.103)  we  shall  need 
the  backward  counterpart  of  (2.105),  in  the  sense  of  Section  2.5. 

Modulo  a  complete  description  of  the  exogeneous  noise  v,  such  a  back¬ 
ward  representation  was  provided  by  Pavon  in  [9] . 

Lenina  2.31.  ([9])  Let  x  be  the  state  process  of  any  realization  in 

S  and  let  P  be  its  covariance  function.  Then  the  process  z*  defined  by 
(2.104)  satisfies  the  backward  recursion 

z*(t)  »Q*(t)r*(t)'Q*(t+l)#z*(t+l)  +Q*(t)H(t)'R*(t)”V*(t)+vCt)  (2.106) 

where  r*  and  Q*  are  given  by  (2.91)  and  (2.18b)  respectively  and  v  is  an 
(unnormalized)  white  noise  whose  components  are  contained  in  [H(y)]X. 
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Relation  (2.106)  is  a  baokuard  representation  in  the  sense  that 
H“(w*,v)  x  H*(zJ  for  all  t  e  [0,T], 


Equation  (2.106)  was  derived  in  [9]  by  first  noting  that  it  is 
no  restriction  to  assume  that  the  basic  Hilbert  space  H  can  be  written 
H  ■  H(y)  •  H(n),  where  n  is  an  n-dimensional  white  noise  process  of 
type  (2.2)  such  that  H“_^(q)  1  H*(z*)  for  all  t  e  fO,T] .  In  fact,  such 
a  framework  is  sufficient  for  representing  the  state,  output  and  input 
processes.  Next,  it  was  seen  that  w*  and  n  could  be  regarded  as  outputs 
of  a  forward  realization  with  (2.105)  as  its  state  equation;  the  white 
noise  character  of  the  output  modifies  the  construction  of  a  backward 
representation. 


An  alternative  derivation  of  Lemma  2.31,  which  in  addition  pro¬ 
vides  a  complete  characterization  of  the  process  v,  can  be  obtained 
along  the  lines  of  Theorem  4.3  in  our  continuous -time  paper  [64]: 

First  note  that,  for  each  t  €  [0,T] ,  there  is  an  orthogonal  p  x  p-matrix 
V(t)  such  that 


B(t)“ 

1  (t)  B2(t)“ 

X 

1  £ 
k 

V(t) 

D(t)_ 

R(t)*  0 

where  Bj^  is  n  x  m  and  is  n  x  (p  -  m).  Let 


u 

v 


=  V  w 


(2.106) 


(2.107) 


define  a  pair  of  mutually  uncorrelated  white  noise  processes  u  and  v 
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of  dimensions  m  and  p  -  m  respectively.  Then  Bw  ■  B^u  ♦  B„v  and  Dw  » 
and  (2.105)  can  be  written 

**(t+l)  «T*(t)z*(t)  -I\,(t)Q*(t)H(t)'  R(t)"\»(t)  +B2(t)v(t).  (2.108) 

To  see  this,  first  use  (2.16c)  to  see  that  R**R  ■  I  -  R^HQ^H'  and 
(2.16b)  to  see  that  BjR1*  -  B*R*  ■  -FQ*H’;  from  this  it  is  easy  to  see 
that  Bj  -  B^R^R1*  ■  .  Given  (2.108),  Lemma  2.31  follows  from 

the  appropriate  modifications  of  Proposition  2.21  and  some  tedious 
calculation.  This  also  provides  an  expression  for  v  in  terms  of  v  and 
z#,  which  is  useful  in  obtaining  a  representation  for  the  smoothing 
error,  such  as  the  one  in  Theorem  4.3  in  [64]. 

Now,  we  are  ready  to  state  the  main  result  of  this  section.  To 
simplify  notations,  we  introduce  the  process  {r(t)  ;  t  e  [0,T+1]} 
defined  by 

r(t)  :»  E{z#(t)  |  H*(w,)}  .  (2.109) 

Since  w#  is  defined  only  on  [0,T],  set  r(T+l)  *  0. 

Theorem  2.32.  Let  x  be  the  state  process  of  any  S  e  S  with  state 
covariance  function  P  and  let  x  be  the  corresponding  smoothing  esti¬ 
mate  (2.80).  Then ,  for  all  t  e  [0,T], 

x(t)  *  x*(t)  +  r(t)  (2.110) 

where  r  is  defined  by  (2.109)  and  satisfies  the  backward  recursion 
f  r(t)  =  Q*(t)r*(t)'Q*(t+l)#r(t+l)+Q*(t)H(t)«R*(t)Jsw*(t) 


r(T+l)  *  0  . 


(2.111) 
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Proof.  In  view  of  the  definition  (2.109)*  (2.110)  is  the  same  as 
(2.103).  Next*  since  the  components  of  v  are  orthogonal  to  H(w*), 
(2.111)  follows  trivially  from  Lemma  2.31.  □ 

Remark.  Relations  (2.110)  and  (2.111)  imply  that  the  covariance  matrix 
£(T>1)  *  Q*(T+1),  which*  in  view  of  Lemma  2.29,  is  positive  definite 
if  S  e  S+. 

The  Bryson-Fraz ier  formula  [35]  can  now  be  obtained  from  the 
above  theorem. 

Corollary  2.33.  Let  x  be  the  state  process  of  a  realisation  S  e.  S+. 
Then  the  smoothing  estimate  x  satisfies 

x(t)  «  x*(t)  ♦  Q*(t)z(t-1)  (2.112) 

where  x*  and  Q*  are  given  by  (2.16)  and  z  satisfies 

z(t-l)  »  I\(t)'f(t)  +  H(t)»R*(t)"V(t)  ;  z(T)  *  0  .  (2.113) 

The  process  z  is  related  to  r  through 

z(t-l)  -  Q*(t)-1r(t)  .  (2.114) 

Proof.  Since  S  e  S+,  Q*(t)  >  0  for  all  t  e  [0,T+1]  (Lemma  2.29)  and 
consequently,  the  process  z  given  by  (2,114)  is  well-defined.  Then  the 
result  follows  from  the  theorem.  □ 

Finally,  to  obtain  the  result  of  Rauch  et  al  [36],  the  following 


proposition  is  needed. 
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Proposition  2.34.  Lst  Sc  S+  and  lot  x  be  its  stats  process.  Then  the 

tmootking  aatimta  x  aatisfi ss  the  difftrtnaa  aquation 

* 

x(t*l)  *  F(t)x(t)  ♦  B(t)(I  -  D(t) 'R*(t)”*D(t))B(t) 'l(t) 

♦  B(t)D(t)»R.(t)‘1[y(t)  - H(t)x#(t)  -H(t)Q,(t)F(t) 'z(t)]  (2.11S) 


x(T*l)  *x#(T+l)  , 


where 


*(t)  -  Q*(t+lj [x(t+l)  -  x#(t+l)]  . 


Proof.  Writing  (2.112)  as 


x(t+l)  ■  x*(t+l)  ♦  Q*(t+l)z(t)  , 


(2.116) 


using  the  recursions  (2.18a)  and  (2.16e)  for  x#  and  Q*  and  adding  and 
subtracting  F(t)Q*(t)z(t-l)  in  (2.116),  we  obtain 


x(t+l)  -  F(t)x(t)  ♦  B(t)B(t)«i(t)  ♦  u(t) 


(2.117) 


where 


U(t)  -  -F(t)Q*(t)*(t-l)  +  F(t)Q#(t)F(t) 'z(t) 
-  B*(t)B*(t)*z(t)  +  B,(t)w*(t) 


(2.118) 


Now  inserting  (2.113)  into  (2.118)  and  using  (2.91)  to  eliminate 
we  get 

U(t)-F(t)Q*(t)H(t)  »R#(t)^B*(t)  'z(t)-F(t)Q*(t)H(t)  *R,(t)“V(t) 
-  B#(t)B*(t)’z(t)  ♦  B*(t)w#(t)  . 


Inserting  (2.16b)  and  (2.17)  into  the  last  two  terms  of  this  expression 

and  cancelling  similar  terms,  we  see  that 
wTt)’»“B(t)D(t)  •R*(t)"1D(t)B(t)  'Z(t) 

+  B(t)D(t)»R#(t)'1Iy(t)  -H(t)x*(t)  -  H(t)Q#(t)F(t)  ’z(t)]  (2.119) 
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which  immediately  yields  (2.115).  G 

Now,  let  BD'  *  0;  this  is  a  basic  assumption  in  [36].  Then 
(2.115)  reduces  to 

x(t+l)  «  F(t)x(t)  ♦  B(t)B(t)'z(t)  ;  x(T+l)  -  x*(T+l)  , 
which,  in  view  of  (2.116),  becomes 

x(t+l)»F(t)x(t)+B(t)B(t)»Q*(t+l)*1[x(t+l)-x#(t+l))  ;  x(T+l)*x#(T+l)  . 

(2.120) 

Corollary  2.35.  Let  x  be  the  state  process  of  any  S  £  S+  and  let 
BD*  *  0.  Then,  the  smoothing  estimate  x  can  be  written 

x(t)*x(t |t)+P(t | t) F (t) ’ [F(t)P(t |t)F(t) •+B(t)U(t) ']_1 [x(t+l)-F(t)x(t |t)] 
x(T+l)  -  x*(T+l)  (2.121) 

> 

where  x(t|t)  is  the  filter  §!{x(t)  |  H~(y)}  ccnd  P(t|t)  is  its  error  co¬ 
variance  matrix,  i.e.,  P(t | t)  *  E([x(t)  -  x(t|t)][x(t)  -  x(t|t)]'}. 

Proof.  First,  solve  (2.120)  for  x(t)  in  terms  of  x(t+l),  add  and  sub¬ 
tract  F(t)-1x*(t+l)  and  rearrange  terms  to  obtain 

x(t)«F(t)'1x*(t+l)  +  [F(t)"1-F(t)'1B(t)B(t)»Q*(t+l)"13[x(t+l)-x*(t+l)]. 

This  can  be  rewritten  as 

x(t)  =  F(t)_1x*(t+l)  +F(t)‘1[Q*(t+l)  -  B(t)B(t)']F(t)'"1F(t)  •  • 

(2.122) 

•  [(Q*(t+1)  -B(t)B(t)')  +B(t)B(t)']'i[x(t+l)  -x*(t+l)]. 

It  is  well-known  and  easy  to  see  that  the  one  step  predictor  x*(t+l)  and 
the  filter  x(t|t)  are  related  by 
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x*(t+l)  »  F(t)x(t | t)  (2.124) 

and  that  the  corresponding  error  covariances  satisfy  the  relation 

Q*(t+1)  «  F(t)P(t |t)F(t) '  +  B(t)B(t) *  .  (2.125) 

Finally,  note  that  (2.124)  and  (2.125)  may  be  reformulated  as 

x(t|t)  -  F(t)-1x* (t+1) 
and 

P(t|t)  «  F(t)'>1[Q#(t+l)  -  B(t)B(t)']F(t)‘"1  , 
which,  inserted  into  (2.122),  yields  (2.121).  □ 

Relation  (2.121)  is  the  formula  of  Rauch,  Tung  and  Striebel 
presented  in  [36] . 

It  remains  to  clarify  the  connections  between  the  results  of 
Sections  2.6  and  2.7.  Note  that  the  two-filter  formula  (2.87)  can 
be  written 

x(t)  *  x*(t)  +  Q*(t)Q(t)#z(t)  , 

where  z  is  defined  by  (2.82).  Comparing  this  with  (2.112),  it  is  seen 
that  we  need  to  prove  that 

z(t-l)  »  Q(t)#z(t)  , 

analogously  with  the  continuous -time  setting  [64,65].  The  problems 
encountered  in  trying  to  show  this  are  similar  to  those  of  proving  the 
conjecture  in  Section  2.6,  and  due  to  time  limitations,  we  are  leaving 
this  question  for  a  future  paper. 


CHAPTER  3 


TOPICS  ON  THE  STOCHASTIC  REALIZATION  PROBLEM  FOR 
CONTINUOUS-TIME  NONSTATIONARY  STOCHASTIC  PROCESSES 

3.1.  Introduction 

Let  the  Hilbert  space  H  be  as  defined  in  Section  2.2.  In  this 
chapter,  the  following  notations  will  be  adopted.  For  an/  n-dimensional 
stochastic  process  z,  Ht(z)  will  denote  the  (closed)  subspace  spanned 
by  the  random  variables  (z^t:),  z2(t),  •••>  *n(t)}.  Let  H(z)  and  the 
past  spaces  H^(z)  and  tj(z)  be  defined  asVT£j  HT(z),  where  the 
interval  I  is  (-»,  *>),  (-«,t]  and  [tQ,t]  respectively.  The  future  spaces 
Hr .  ,(z)  and  H*(z)  are  defined  analogously.  Sometimes,  we  shall  be 

l.i,  t^j  t 

interested  in  spaces  spanned  by  the  increments  of  z.  Hence,  we  de¬ 
fine  H(dz),  H~(dz)  and  H*(dz)  to  be  the  closed  linear  hulls  in  H  of 
(z(s)  -  z(r);  s,  r  e  I},  where  the  interval  I  is  (-00,00),  C-°°,t]  and 
[t,00)  respectively. 

Let  (x(t) ;  t  e  R }  and  {y(t) ;  t  e  ]R }  be  two  stochastic  processes 
of  dimensions  n  and  m  respectively,  defined  as  the  solution  of  the 
linear  stochastic  system 
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(S) 


dx  *  F(t)x(t)dt  +  B(t)dw 
dy  *  H(t)x(t)dt  +  D(t)dw 


(3.1a) 


where  w  is  a  vector  process,  of  dimension  p  £  m,  with  orthogonal  in¬ 
crements  such  that 


H{dw>  -  0 


E{dwdw’}  «  I  dt 


(3.2) 


and  Ht(dw)  1  H“(x)  for  all  t  £  R.  The  matrix  R(t)  :*  D(t)D(t)'  is. 
positive  definite  on  R,  the  matrix  F(t)  is  uniformly  asymptotically 
stable  on  ]R  and  F,  B,  H,  D  and  R"1  are  matrices  of  bounded  and  analy¬ 
tic  functions.  As  before,  the  process  x  is  called  the  state  of  the 
model  S,  y  is  the  output  and  w  is  the  input.  We  shall  assume  the 
model  S  to  be  minimal  in  the  sense  that  there  is  no  other  model  of  the 
form  (3.1)  with  the  process  y  as  its  output  and  with  a  state  process  x 
of  smaller  dimension  than  n.  The  stochastic  realization  problem  con¬ 
sists  of  finding  all  possible  stochastic  systems  (3.1)  (belonging  to 
a  class  S  to  be  prescribed  below)  having  the  process  {y(t) ;  t  e  IR}  as 
their  output.  Each  such  model  S  will  be  called  a  stochastic  realiza¬ 
tion  of  y;  In  particular,  S  is  minimal  and  analytic,  i.e.,  F,  B,  H, 

D  and  R_1  are  analytic  on  ]R . 

For  each  t  e  1R,  there  exists  an  orthogonal  matrix  V(t)  such  that 


B(t! 

Bx(t)  B2(t) 

D(t) 

JHt)*5  0 

V(t) 


(3.3) 


where  B^  is  n  x  m  and  B^  is  n  x  (p  -  m) .  It  is  no  restriction  to  take 
V(t)  =  I.  Next,  let 

[dul 


dv 


=  dw 


(3.4) 
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define  a  pair  of  orthogonal  increment  processes  u  and  v,  of  dimensions 
m  and  p  -  m  respectively.  It  is  obvious  that  (3.4)  satisfies  (3.2). 
Accordingly,  we  have  reduced  the  problem  to  that  of  finding  all  admiss- 

able  models  S  of  the  type 

► 

dx  ■  F(t)x(t)dt  +  B.  (t)du  +  B.(t)dv  (3.5a) 

(S)  •  k 

dy  «  H(t)x(t)dt  +  R(t)*  du  ,  (3.5b) 

having  the  process  -Cy(t) ;  t  e  R}  as  an  output.  Clearly,  the  matrix 
function  P(t)  :*  E{x(t)x(t)’}  satisfies  the  differential  equation 

P  -  FP  +  PF»  +  B^B£  +  B2B£  (3.6) 

on  ]R.  We  shall  call  P  the  state  covariance  function  of  S. 

As  we  have  seen  in  the  previous  chapters,  it  is  sometimes  more 
convenient  to  use  a  backward  representation  for  the  state  process  x. 

To  do  this,  we  need  to  invert  the  matrix  function  P.  Since  S  is  mini¬ 
mal,  (F,B)  must  be  completely  controllable  [4,63].  Since  in  addition, 

F  and  B  are  analytic,  (F,B)  must  be  totally  controllable  [68,69], 

With  this  condition  satisfied,  it  is  not  hard  to  prove  that  P(t)  must 
be  positive  definite  for  all  t  e  R  (see  [71;  p.28]).  Then,  the  process 

x(t)  =  P(t)"1  x(t)  (3.7) 

is  well-defined  with  components  in  H.  Let  P  be  its  covariance  function 

P(t)  =  E{x(t)x(t) '}  .  (3.8) 

The  following  lemma,  the  proof  of  which  can  be  found  in  [64],  is  the 
analogue  of  Lemma  2.1. 

Lemma  3.1.  Let  x  be  the  state  process  of  an  arbitrary  realization 
(3.5)  with  state  covariance  P.  Then  the  process  x  defined  by  (3,7) 


To  obtain  a  backward  realization  for  y,  insert  (3.9)  in  (3.5b)  to 
get  dy  ■  (HP  +  R^Jx  +  R^du.  Then,  defining  and  S2  to  be  P^Bj  and 
P-1B2  respectively.  Lemma  3,1  yields 

dx  *  -F’(t)x(t)dt  +  B.  (t)du  +  B,(t)d? 

(S)  (3.11) 

dy  *  G’(t)x(t)dt  +  R^(t)du  , 

where  the  n  x  m-matrix  function  G  is  defined  by 

G  =  PH*  +  B^  .  (3.12) 

Note  that  (3.5)  and  (3.11)  have  the  same  state  space,  i.e., 

Ht(x)  =  Ht(x)  (3.13) 

for  each  t  e  TR  .  We  shall  call  any  model  of  type  (3.11)  with  y  as  its 

output,  and  H“(dw)  l  H*(x)  for  each  t  e  ]R  a  backward  realization  of 
t  z  » 

(y(t);  t  «  ]R}.  Note  that  §  is  also  analytic. 
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3.2.  The  Finite  Interval  Case:  A  Review 


In  order  to  complete  the  statement  of  the  realization  problem 
indicated  in  Section  3.1,  we  shall  have  to  specify  the  classes  S 
and  S  to  which  the  realizations  (3.5)  and  (3.11)  respectively  belong. 
To  this  end,  we  shall  first  restrict  ourselves  to  the  finite  interval 
[tQ,t^],  whore  tQ  and  t^  are  arbitrary  elements  of  IR.  Realization 
theory  for  processes  defined  on  a  finite  interval  was  developed  in  our 
papers  [64,65]  (this  is  the  continuous-time  version  of  the  theory  of 
Chapter  2)  and  next  we  shall  briefly  review  some  facts  from  it. 


Hence  we  shall  consider  models 


(S) 


dx  *  F(t)x(t)dt  +  B1(t)du  +  B2(t)dv 
dy  *  H(t)x(t)dt  +  RU^du 


x(t0) 


K 

(3.14) 


of  type  (3.5)  but  defined  on  the  finite  interval  [t  ,t^].  We  shall 
call  such  representations  stochastic  realizations  of  {y(t);  t  e  [tQ,tj]}. 
Note  that  it  is  now  necessary  to  specify  the  initial  condition  £;  dif¬ 
ferent  £  will  define  different  realizations  S.  If  N  :=  E{£5'}  is 
positive  definite,  x(tQ)  can  be  thought  of  as  being  generated  by  a 
model  (3.5)  on  the  interval  (-®,tQ];  consequently,  (3.14)  is  merely  a 
restriction  of  (3.5)  to  the  interval  [tQ,tj].  However,  note  that  the 
class  (3.14)  also  contains  realizations  S  such  that  N  is  singular 
(even  zero);  then  there  is  no  such  interpretation,  and  we  shall  have 
to  take  some  care  in  defining  the  corresponding  backward  model. 


The  linear  least -squares  estimate 

x*(t;tQ)  *  E{x(t)  |  Hjt  >t](y)> 


(3.15) 
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of  the  state  process  x  of  S  is  generated  on  [tQ,t1]  by  the  Kalman-Bucy 
filter 


dx*(t;t  )  -  F(t)x*(t,t  )dt  +  B*(t,tQ)du*(t,t0)  ;  x*(t0;tQ)  -  0  (3.16a) 


where 


du„(t,tl  -  R(t)_)5[dy  -  HCt)*.(t;t0)4t]  .  (3.16b) 


"o'  *  '  -  -  -  o 

The  matrix  function  B*,  called  the  Kalman  gain  is  given  by 

B*(t,to)  -  Q*(t,tQ)H(t)  ’RCt)”15  ♦  Bx(t)  , 
the  error  covariance  matrix 


(3.16c) 


Q*(t,to)  E([x(t)  -  x*(t;to)][x(t)  -  x*(t;tQ)]'}  (3.16d) 

being  the  solution  of  the  matrix  Riccati  equation 
^Kt,to)  -  F(t)Q*(t,t0)  +  Q*(t,t0)F(t)»  -  B*(t,t0)B*(t,t0)'  ♦  B(t)B(t)  ' 

lq.(t0,t0)  -  P(t0)  , 

with  B*  given  by  (3.16c). 

•Let  P*(t,t0)  E{x*(t;to)x*(t;to)0.  Then,  it  is  easy  to  see  that 

P*(t,t  )  -  P(t)  -  Q*(t,tQ)  (3-17) 


for  all  t  e  [t^tj]  and  that 


P*Ct0,t0)«o 


(3.18) 


The  representation  (3.14)  is  not  the  only  model  defined  on  [tQ,t1] 
that  has  (3.16a)  as  its  Kalman-Bucy  filter.  Hence,  define  S[tQ,t1]  to 
be  the  class  of  all  analytic  realizations  S  of  (y(t);  t  e 
which  has  (3.16a)  as  its  Kalman-Bucy  filter  on  [t^].  Since  we  are 
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only  considering  proper  [2]  realizations,  not  only  F,  H,  R  and  B*,  but 
also  the  estimates  x#(t;t  )  will  be  the  same.  Of  course,  different 
S  c  S[tQ,tjJ  will  have  different  state  processes  x  and  different  Q*,  B, 
w  and  P.  Rewriting  (3.16b)  to  obtain  an  expression  for  dy,  it  is  easy 
to  see  that  (3.16a)  and  (3.16b)  together  define  a  realization  in 
S[t0,tj];  we  shall  call  it  S*(t  .tj). 

Now  for  any  realization  S  e  S[tQ,t1],  the  n  *  m-matrix  function  G 
defined  by  (3.12)  is  invariant  for  the  class  S.  To  see  this,  note  that 
(3.16c)  and  (3.17)  yield  that 

G(t)  -  P*(t,to)H(t)»  ♦  B*(t,t0)R(t)**  ,  (3.19) 

and  that 

B*(t,t0)  «  (G(t)  -  P*(t,t0)H(t)')R(t)'J5  .  (3.20) 

Therefore,  (3.18)  may  be  written 

^  Gt,t0)  -  A(t,P*(t,t0))  ;  P*(t0,tQ)  »  0,  (3.21) 

where  A  is  defined  by 

A(t,P)  =  F(t)P  +  PF(t) 1  +  (G(t)  -  PH(t) ’)R(t) -1 (G(t)  -  PH(t)')'. 

(3.22) 

Let  S+[tQ,t^]  denote  the  subclass  of  all  realizations  in 
S[tQ,ti]  such  that  N  >  0.  For  such  an  S  the  construction  of  backward 
realizations  presented  in  Section  3.1  remains  valid,  for  X  will  be  well- 
defined  on  the  whole  interval  [tQ,t^].  Then,  by  symmetry  with  the. 
forward  setting,  we  can  now  see  that,  given  an  arbitrary  backward  reali¬ 
zation  S  €  S  rt  ,t.],  the  estimate 

x*(t;tx)  -  i{x(t)  |  H^t  j(y)> 


(3.23) 


no 


of  the  process  X  is  generated  on  by  the  backward  Kalman-Buay 

filter 

dx*(t;tjO  ■  -F(t)rx*(t;tj)dt  +  5#(t,t1)da#(t,t1)  ;  X*(t1;t1)»0,  (3.24a) 
where 

du*(t,t1)  «  R(t)_J*[dy  -  G(t)'xw(t;t1)dt]  .  (3.24b) 

The  matrix  function  8*  is  given  by 

Sjt.tp  «  -Q*(t,t1)G(t)R(t)"Is  +  Sx(t)  ,  (3.24c) 

where 

^(t.t^  -F(t)'Qt(t,t1)  •0*(t,tl)F(t)  +8*(t,t1)8*(t,t1)'  -8(t)8(t)' 

* 

&CW  «  P(t:)  -  P^1  .  (3. 24d) 

Here,  of  course,  Q*  is  the  error  covariance  matrix 

QfrCt.tp  :■  E{[X(t)  -  ^(tjtj)]  [x(t)  -  x*(t;t1)]'}  .  (3.24e) 

Let  P*(t,t1)  :*  Etx^tjtjJx^tjtj) '}.  Then 

P*(t,tx)  -  PCt)  -  Q*(t,tx)  (3.25) 

for  all  t  e  [tQ,t1]  and 

^Kt,tx)  =  A(t,P*(t,t1) )  ;  PJt^)  *  0  ,  (3.26) 

where  A  is  given  by 

A(t,P)  *  F(t) 'P  +  PF(t)  +  (H(t) '  -  PG(t))R(t)_1(H(t) '  -  PG(t)) ' .  (3.27) 

It  can  be  shown  that  the  matrix  function  B*  is  invariant  over  the 
class  S[tQ,t1)  (of.  Lemma  2.5).  Hence  the  backward  filter  (3.24a)  is 
invariant  also.  Now,  define  ^[t^t^]  to  be  the  class  of  all  analytic 
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backward  realizations  S  of  (y(t);  t  €  which  has  (3.24a)  as  its 

Kalman- Bucy  filter. 

Analogously  to  the  forward  setting,  it  is  seen  that  (3.24a)  and 

(3.24b)  properly  reformulated,  constitute  a  realization  in  S[tQ,t.];  it 

will  here  be  called  3*(t  ,t.). 

0  1 

We  wish  to  construct  the  forward  counterpart  S*(to,tj)  of 
3*(t0,t.).  However,  we  now  run  into  problems,  because  P*(tj)  «  0  so 
that  the  (reverse  of  the)  construction  in  Section  3.1  is  no  longer 
valid.  But,  as  shown  in  [64,65],  the  minimality  and  analyticity  of 
3*  implies  that  P*  :»  P*1  is  well-defined  on  [t^tj-e]  for  all  e  >  0, 
and  hence  so  is  x*  :*  P**x*.  Consequently,  we  can  define  the  model 

dx^Ctjt.)  * F(t)x*(t;t.)dt  +  B*(t,t,)R(t)  ^[dy  -  H(t)x*(t;t. )dt] 
(S*(t  ,t))'  1  A  1  L 

°‘  1  |**CVV  *  MVV’^-'W  (3.28a) 

on  any  such  interval;  such  a  realization  is  called  a  genei‘dlized  reali¬ 
zation  of  (y(t);  t  e  [t^tj]}.  (See  [64,65].)  Here 

B*(t,tx)  -  -(^(t.t^HW'R^)^  -  B^t))  ,  (3.28b) 

with  Q*  satisfying 

^jrCt,^)  ■F(t)Q*(t,t1)  ♦Q*(t,t1)F(t)'  +B*(t,tl)B*(t,t1)'  -B(t)B(t)> 
.QM^.tj)  *  P*Ct0,t1)"1  -  P(t)  .  (3.28c) 

Let  P^ttjtj)  :=>  E{x*(t;t1)x*(t;t1) *} .  Then,  it  can  be  seen  that 

P*(t,tx)  *  P(t)  +  Q*(t,tx)  (3.29) 

and  that 
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-  ACt,P*(t,t.A))  ;  P*^,^)  ■P*(t0,t1)"1.  (3.30) 

As  we  have  seen  in  Chapter  2,  since  Q#(t,tQ)  z  0  and 
Q*(t,tA)  z  0,  it  can  be  easily  seen  that  for  all  t  e  [tQ,tA], 

P*(t.tQ)  *  P(t)  s  P^Ct.t^  .  (3.31) 

Hence  the  models  S*  and  S*  are  called  the  minimum-variance  and  the 
maximum-variance  respectively  (cf.  [64,65]). 


Finally,  we  recall  that  these  two  models  S#  and  S*,  contain  all 
the  information  on  y  that  is  needed  to  estimate  x.  Consequently,  as 
was  done  in  Chapter  2  and  in  [64,65],  it  is  seen  that  the  smoothing 
estimate 


xtet^tj)  *  E(x(t)  |  H[t  j(y)}  (3.32) 

of  the  state  process  x  of  any  realization  S  e  S+[tQ,tA]  is  given  by 
xCtJtjjtj)  *  [I  -  0*(t,to)Q(t,to,t1)~1]x^(t;to)  + 


on  [to,tA),  where 


+  Q*(t,t0)Q(t,t0,t1)“1x*(t;t1) 


Q(t,tQ,t1)  *  P*(t,tA)  -  P*(t,tQ) 


(3.33) 


(3.34) 


3.3.  Stochastic  Realizations  on  IR 

In  this  section,  we  shall  let  tQ  ->■  -»  and  t^  ».  Consequently, 
we  shall  extend  the  discussion  of  the  previous  section  to  the  infinite 
interval  setting  of  Section  3.1. 

Since,  for  each  fixed  t  e  R,  the  process  {x*(t;-T);t£  -t}  is  a 
uniformly  integrable  wide  sense  martingale  [70],  x*(t;tQ)  tends  to  a 
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limit  x*(t)  :•  E(x(t)  |  VtoStHjt  >tj(y)}  in  me4n  as  tQ  -►  -» 

(cf.  [2;  p.378]).  But  Vt0StH[t0,t] Cy)  *  Ht(y)  and  conse<lu«ntly 

x*(t)  -  E{x(t)  |  H;Cy)}  (3.35) 

for  all  t  €  R.  Then  u*(t,tj  tends  to  a  limit  process 

o 

(u*(t);  t  e  R}  which  satisfies  (3.2),  for  u*(t,tQ)  satisfies  (3.2). 
Since  x*(t;tQ)  •+■  x#(t),  P*(t,tQ)  and  B*(t,tQ)  as  given  by  (3.17)  and 
(3.20)  tend  to  the  limits  P*(t)  and  B*(t)  respectively.  Consequently, 
x*  and  u*  must  satisfy 

dx*(t)  -  F(t)x*(t)dt  ♦  B*(t)R(t)"J5[dy  -  H(t)x*(t)  dt]  (3.36) 


for  each  t  e  R.  Now,  we  define  S  to  be  the  class  of  all  analytic 
realizations  (3.5)  of  (y(t);  t  e  R}  whose  Kalman-Bucy  filter  on  any 
interval  [tQ,t1]  tends  to  (3.36)  (in  the  obvious  sense)  as  tQ  +  -09 
and  as  tj  •+■  ».  It  is  easy  to  see  that  (3.36)  may  be  reformulated  to 
yield  the  model 


CSJ 


Fx*dt  +  B*du* 

ic 

Hx*dt  +  R  du*  . 


(3.37) 


Let  E{x+(t)  |  Hr.  ^i(y)>  be  denoted  X*(t;t  ).  Then,  by  a  similar 
argument  to  that  leading  to  (3.35),  the  limit  in  mean  square  of 

A  _ 

$*(t;t  )  as  t  +  ■«  is  seen  to  be  E(x*(t)  |  Ht(y)}#  which  is  x*(t). 
Since,  in  addition  S*  is  minimal  and  analytic,  this  implies  the  model 
S*  belongs  to  S. 


The  following  proposition  summarizes  the  pertinent  facts  about 
the  model  S*.  The  results  are  obtained  from  the  corresponding  finite 
interval  results  by  merely  taking  the  appropriate  limits  as 
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explained  above.  It  is  the  nonstationary  version  of  Theorem  4.1 
in  [2]. 

Proposition  3.2.  There  is  one  and  only  one  realization  (3.5)  in  S, 
namely  (3.37)  having  aty  of  the  following  properties: 

(i)  x#(t),  u*(t),  B*(t)  and  the  state  covariance  P*(t)  of  S*  are 
the  limits  (the  first  two  in  the  mean  square )  of  x#(t;tQ) , 
u#(t.,t0)>  B*(t,tQ)  and  P*(t,tQ)  respectively  as  tQ  +  -*>, 

(ii)  the  covariance  matrix  function  P*  satisfies 

P*(t)  «  A(t,P*(t))  ,  (3.38) 

where  A  is  given  by  (3.22),  and  it  is  minimum  in  the  sense  that 

P*(t)  *  P(t)  (for  each  t  £  *)  ,  (3.39) 

(Hi)  the  innovation  process  u#  satisfies 

H^du*)  -  H~(y)  (3.40) 

for  all  t  (  1,  and 

(iv)  for  any  realization  S  e  S,  with  state  process  x,  the  process 
x*  is  the  estimate 

H{x(t)  |  H'(y)}  =  x*(t)  (3.41) 

(i.e.t  x*  is  invariant  with  respect  to  the  particular  realiza¬ 
tion  S  e  S.). 

In  analogy  with  the  forward  setting,  we  see  that  the  process 
x*(t;t^)  tends  to  a  limit  x*(t)  in  mean  square  as  t^  -*■  ».  Consequently, 
u*(t,tp,  B*(t,tp  and  P*(t,t^)  tends  to  the  limits  u*(t),  B*(t)  and 
P*(t)  as  t^  •+  «.  Hence,  we  obtain  a  representation  analogous  to  (3.36) 


US 


for  we  shall  call  it  the  steady-state  backward  Kalmcn-Buay  filter. 
Therefore,  we  define  S  to  be  the  class  of  all  analytic  backward  reali¬ 
zations  (3.11)  of  (y(t);  t  £  1}  whose  Kalman-Bucy  estimate  on  the 
interval  tends  to  the  steady-state  backward  Kalman-Bucy  filter 

as  tj  •+  «.  Precisely  as  in  the  forward  setting,  we  can  see  that  x*  is 
the  state  process  of  the  model 


(3*) 


dx*  *  -F*x*dt  +  B*du* 
u 

dy  ■  G'x*dt  +  R^du* 


(3.42) 


and  that  3*  c  S.  The  state  covariance  function  P*  of  3*  satisfies 

P*(t)  *  A(t,P*(t))  (3.43) 

where  A  is  given  by  (3.27). 

Since  P*(t)  >  0  for  all  t  e  R,  we  see  that  the  process 


x*(t)  -  P*(t)_1X*(t) 
is  well-defined,  and  is  the  state  of  the  model 


(3.44) 


(S*) 


dx*  *  Fx*dt  +  B*du* 


dy  *  Hx*dt  +  R^du*  , 


(3.45) 


where  B*  and  u*  are  the  limits  of  B*(t,t^)  and  u*(t,tp  as  -*■  ».  The 

state  covariance  function  P*  of  (3.45)  is  P~*  =  ( lim  P*(t,t1))"^  * 

-l  tl’+co 

lim  (P*(t,t,))  *  lim  P*(t,t.).  It  is  easy  to  see  that  P*  satisfies 

*  Jt  ^  _  i 

ti-»« 


V 


P*(t)  -  A(t,P*(t)) 


(3.46) 


and  that  Q*(t)  :=  E{[x(t)  -  x*(t)][x(t)  -  x*(t)]'}  is  given  by 


Q*(t)  *  P*(t)  -  P(t)  . 


(3.47) 
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Since  x*(t)  «  lim  x^tjtj)  and  Etx^tjt^  |  Hft  tl(y)>  *  x*(t;tQ), 

^*400  U  0^  ^ 

we  see  that  the  Kalman-Bucy  filter  of  S*  on  [tQ,t^]  tends  to  x*  in 
mean  square  as  t  Hence  S*  €  S. 

As  a  corollary  to  Proposition  3.2  and  the  above  discussion, 
it  is  now  clear  that 

P*(t)  *  P(t)  s  P*(t)  (3.48) 

for  all  t  £  1. 

As  another  corollary,  we  shall  obtain  the  following  algorithms 
to  calculate  P*  and  P*.  They  were  originally  obtained  by  Clerget  [71] 
through  an  argument  first  used  in  [11] .  The  proof  in  [71]  relies  on 
control  theory  techniques  and  gives  little  insight  into  the  stochastic 
nature  of  the  problem.  Our  proof  is  not  only  simpler,  but  it  also  pro¬ 
vides  a  stochastic  interpretation  for  these  equations. 


Corollary  3.3.  Let  II  and  II  be  the  unique  solutions  of  the  n  x  n- 
matrix  partial  differential  equations 


3' 


3I(t,s) 


A(t,n(t,s)) 


JI(t,0)  =  0 


(3.49) 


and 


4-+  5s)n(t,s)  =  A(tJ(t,s))  ;  frct,0)  *  o  (3. so) 

respectively ,  where  A  and  A  are  given  by  (3.22)  and  (3.27)  respectively . 
Then  II(t,s)  +  P*(t)  and  fl(t , s)  -*■  P*(t)_1  a«  s  +«. 


Proof:  Let  t  e  3R.  Set  s  =  t  -  t  and  II(t,s)  :=  P*(t,t  -  s) .  Then 
P*(t,tQ)  =  H(t,t  -  tQ).  Hence  ^  (t,tQ)  *  [~  +  ^]ll(t,t  -  tQ). 


! 
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But,  by  (3.22),  ^  (t,tQ)  =  A(t,P*(t,tQ))  *  A(t,II(t,t  -  tQ)).  Since 
P*(t,t)  -  0,  II(t,0)  »  0.  Hence  (3.49)  follows.  To  see  that 
H(t,s)  P*(t)  as  s  +  »,  just  observe  that,  by  Proposition  3.2, 
P*(t,tQ)  P*(t)  as  tQ  -*■  -».  Finally,  (3.50)  follows  from  (3.43)  and 
an  argument  analogous  to  the  above.  □ 


Finally,  we  shall  solve  the  smoothing  problem  for  this  infinite 
interval  setting. 

Let  S  e  S  be  arbitrary.  The  smoothing  problem  requires  deter¬ 
mining  the  estimate 

x(t)  :*  E{x(t)  |  H(y)}  (3.51) 


of  the  state  x  of  S.  Recall  that  the  smoothing  estimate  x(t;tQ,t1)  on 
the  finite  interval  [t^t^  is  defined  by  (3.32).  Since,  by  the  argu¬ 
ment  used  above 


x(t;tQ,t1)  *  E{x(t)  |  H[t  >ti]Cy)}  +  E{x(t)  |  H(y)}  *  x(t) 

as  t  ■+•  -“  and  as  t,  •+■  00 ,  we  see  that  (3,33)  yields  the  following 
o  l 

formula  ((3.53)),  which  holds  under  the  assumption  that  the  covariance 
function  of  y  is  ooerowe,  i.e.,  there  exists  a  positive  constant  C 
such  that 


't  2 

.  u* (r)K  (r,s)u(s)drds  >  C  ||u(t)|| 

y 


(3.52) 


for  all  square  integrable  functions  u.  It  was  shown  by  Clerget 
[71;  p. 29]  that  this  assumption  implies  P*  -  P*  >  0. 


W|<IIPWI|II1|I|||I|1 
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Proposition  3.4.  Assume  that  the  aooariance  function  K  of  y  is  co¬ 
ercive.  Let  x  be  the  state  prooess  of  an  arbitrary  realisation  S  e  S, 
and  let  x  be  the  corresponding  smoothing  estimate  (3.51).  Then 

x(t)  -  [I  -  Q*(t)Q(t)"1]x*(t)  +  Q*(t)Q(t)_1x*(t)  ,  (3. S3) 

where  Q*  »  P  -  P*  and  Q  «  P*  -  P*. 

Remark.  Formula  (3.53)  suggests  a  representation  for  the  state  process 
of  an  internal  realization  (i.e.,  a  realization  satisfying  H(x)  c  H(dy).) 
If  S  e  S  is  internal,  then,  by  (3.51),  x(t)  *  x(t)  for  each  t  e  IR. 

Corollary  3.5.  Assume  that  the  oovariance  function  K of  y  is  coercive . 
Let  x  be  the  state  process  of  an  internal  realization  S  e  S.  Then  x  can 
be  written 

x(t)  *  n(t)x*(t)  +  [I  -  n(t)]x*(t)  ,  (3.54) 

where  II  is  a  projection  given  by  U  *  Q*Q-1  Q*  *  P*  -  P. 

Proof.  In  view  of  Proposition  3.4,  it  only  remains  to  show  that 
H  =  Q*Q_1  is  a  projection.  Let  2(t)  :*  E{[x(t)  -  x(t)][x(t)  -  x(t)]'}. 
Then,  it  can  be  shown  (Chapter  2)  that  Z  =  Q*  -  Q*Q-1Q*.  Then,  since 
x  *  x,  E  »  0.  Consequently,  Q*Q-1  *  Q*Q"*Q*Q~*.  But  I  -  Q#Q-1  * 

Q*Q-1,  hence  Q*Q_1  =  Q*Q_1Q*Q"1,  i.e.,  n2  =  n.  □ 
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3.4,  The  Set  P 

Let  P  be  the  set  of  all  state  covariance  functions  P  given  by 
(3.6)  as  S  ranges  over  S.  Any  P  £  P  must  also  satisfy  (3.12);  (3.6) 
and  (3.12)  constitute  the  equations  of  the  nonstationary  version  of 
the  Positive  Real  Lemma.  Since,  in  addition,  the  state  covariance  P 
of  any  S  £  S  is  positive  definite,  we  may  write  P  ■  {P  ■  P*  >0  |  P 
solves  (3.6)  and  (3.12)}.  It  can  easily  be  checked  that  P  is  bounded 
and  convex  [71],  Some  straightforward  algebraic  manipulations  yield 

P  *  (P  «  P*  |  A(t,P(t))  -  P(t)  £  0  for  all  t  £  »}.  (3.55) 

For  each  P  e  P,  define  the  feedback  matrix 

r  ■  F  -  (6  -  PH,)R"'1H  .  (3.56) 

Let  the  feedback  matrices  corresponding  to  P*  and  P*  be  denoted  T*  and 
T*  respectively.  Let  P+  *  {P  |  P  >  P*}  and  P_  *  (P  |  P  <  P*}.  If  the 
covariance  function  of  y  is  coercive,  i.e.,  satisfies  (3.52),  P*  -  P*  >  0. 
Consequently  P+  and  P_  are  both  nonempty. 

Now  let  P  be  the  subset  of  P  defined  by 
o 

?Q  *  {P  £  P  |  A(t,P(t))  -  P(t)  =  0}  .  (3.57) 

Then  it  is  immediately  seen  that  PQ  *  (P  £  P  j  B2  *  0}. 

Lemma  3.6.  Let  S  £  S  with  state  covariance  function  P.  Then  S  is  in¬ 
ternal  if  and  only  if  P  £  P  . 


Proof.  Let  P  €  P  .  Then  B9  *  0,  hence  S  has  the  form 

v  4* 
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dx  ■  Fxdt  +  B^du 
dy  *  Hxdt  +  R^du  . 

k 

Consequently,  x  can  be  solved  for  in  terms  of  y  as  follows 

dx  *  Fxdt  +  BjR^dy  -  Hxdt] 

which  clearly  implies  that  S  is  internal.  Since  the  smoothing  for¬ 
mula  (3.53)  is  the  same  as  (3.33),  the  argument  in  proving  Theorem  4.4 
in  [64]  can  be  used  to  prove  the  converse.  □ 

Corollary  3.7.  P*  and  P*  belong  to  P  . 

Proof.  This  follows  from  (3.38)  and  (3.46).  □ 

It  is  worth  noting  that  once  the  covariance  function  P  of  a 
realization  S  e  S  is  known,  the  quadruplet  [F,  B,  H,  (R  ,0)]  is  deter¬ 
mined  upon  observing  that 

Bx  =  (G  -  PH‘)R->S  (3.58a) 

BjB^  =  P  -  A(P)  .  (3.58b) 

h 

Determining  the  quadruplets  [F,  B,  H,  (R  ,0) ]  solves  what  we  have 
called  before  the  wide  sense  stochastic  realization  problem,  i.e.,  is 
equivalent  to  finding  all  realizations  whose  outputs  have  the  same  co- 
variance  properties.  In  the  next  section,  we  shall  present  an  algorithm 
to  generate  such  quadruplets. 


Finally,  it  is  not  hard  to  see  that  Proposition  1.12  and  Corol¬ 
laries  1.13  and  1.14  hold  for  this  setting  also  with  obvious  modifications. 
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3.5.  Non-Riccati  Algorithm  Inside  8 

In  this  section,  we  shall  present  an  algorithm  to  generate  (wide 
sense)  realizations  without  resort  to  the  intermediate  step  of  solving 
for  P.  This  algorithm  is  the  nonstationary,  continuous-time  version  of 
(1.54)  and  Theorem  6.2  in  [2].  In  order  for  the  idea  of  proof  used  in 
[2]  to  be  applicable  in  our  nonstationary  setting,  we  shall  have  to  in¬ 
troduce  an  additional  condition  on  the  given  process  (y(t);  t  e  1R): 
Assume  that  y  is  generated  as  the  output  of  a  stochastic  system  (3.5) 
such  that  B2  is  constant  and  nonzero.  Of  course  this  does  not  imply 
that  all  S  e  S  have  such  a  B2,  merely  that  there  are  nontrivial 
(B2  *  0),  elements  in  the  subclass  Sc  :«  (S  £  S  |  B2  constant).  Let  P£ 
be  the  subset  of  P  corresponding  to  realizations  in  Sc> 

Lenina  3.8.  Let  P  e  P  .  Then  A(t,P(t))  -  P(t)  ie  constant  on  the 

C 

real  line. 

To  develop  the  algorithm,  we  shall  first  construct,  for  a  given 

matrix  function  P  e  P  ,  a  trajectory  of  matrix  functions  in  P  extend- 
o  c 

ing  from  P*  through  PQ  to  P*,  so  that  these  functions  are  totally  ordered 
in  a  sense  to  be  defined  below. 

Theorem  3.9.  Assume  that  the  covariance  function  K of  y  is  coercive. 

Let  A  be  defined  by  (3.22).  Let  P  be  an  arbitrary  function  in  P  . 

Let  P  be  the  unique  solution  in  the  (t,s)  plane  of  the  matrico  partial 
differential  equation 
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33]  P(t,s)  -  A(t,P(t,s))  ;  P(t,0)  -  PQ(t)  .  (3.59) 

Then*  (i)  P(»,3)  e  ?  for  all  s  e.  (-",»),  (ii)  for  each  t  e  R, 

P(t,s2)  s  PftjSj)  for  Sj  3  *2*  OH)  if  p0  €  P_*  for  9ach  t  e  R, 

P(t,s)  P*(t)  as  s  +  *  and  (iv)  if  PQ  e  P+J  for  each  t  £  R, 

P(t,s)  -*■  P*(t)  as  s  > 

Remark.  Before  proving  this  theorem,  it  is  worth  noting  that  the  par¬ 
tial  differential  equation  (3.59)  can  be  trivially  transformed  to  an 
(infinite)  family  of  ordinary  differential  equations  (3.18)  with  differ 
ent  initial  conditions.  To  see  this,  set  a  *  t  -  s  and  let 
P(t,a)  :«  P(t,t  -  a).  Then,  it  is  easy  to  see  that  the  left-hand  side 

«s* 

A  p 

of  (3.59)  is  j|~(t,a).  Finally,  P(a,0)  *  P(a,o).  Hence,  (3.59)  is  the 
ordinary  differential  equation 

g|(t,o)  -  A(t,P(t,o))  ;  P(a,o)  -  P Q(o)  .  (3.60) 

The  following  two  lemmas  will  be  needed  for  the  proof  of 
Theorem  3.9. 

Leraa  3.10.  The  matrix  partial  differential  equation  (3.59)  hae  a 
unique  eolation  P(t,s)  which  is  a  matrix  of  analytic  functions  in  the 
two  real  variables  t  and  s. 

Proof.  It  is  well  known  that  the  ordinary  differential  equation  (3.60) 
has  a  unique  solution  [4;  p.156]  which  is  also  analytic  (for  the  param¬ 
eter  matrices  are).  Then  the  same  is  true  for  (3.59).  □ 
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Lemma  3.11.  Let  PQ  e  P  .  Then  the  matrix  partial  differential  equa¬ 
tion  (3.S9)  can  be  replaced  (in  the  eenee  that  it  has  the  acme  solution  P) 
by  the  system 

-  U(t>S)[ACt,P0(t))  -  P0(t)]U(tfs)'  ;  P(t,0)  -  PQ(t)  (3.61a) 
(sF+ 5?)U(t's)  *  r(t,s)U(t,s)  ;  U(t,0)  -  I  (3.61b) 

where  T(t,s)  is  the  feedback  matrix  (3.56)  corresponding  to  P(t,s). 

Proof.  We  shall  use  the  differentiation  technique  of  [21].  The  reason 
why  this  method  works  in  this  nonstationary  setting  is  of  course  that 
the  coefficient  matrices  do  not  depend  on  s,  which  is  the  dynamic  vari¬ 
able  of  the  algorithm.  First,  reformulate  relation  (3.59)  to  read 

3F  +  ?s)P(t»s)  3  [F(t)  -  6(t)R(t)”1H(t)]P(t,s)  +  P(t,s)  [F(t)  -GCtJRW’Vt)]' 

+  P(t,s)H(t)»R(t)’1H(t)P(t,s)  +G(t)R(tr1G(t)'  .  (3.62) 

Since  P(t,s)  is  a  matrix  of  analytic  functions,  the  mixed  partial 
derivatives  of  P(t,s)  with  respect  to  t  and  s  are  identical.  Using 
this  fact,  differentiating  (3.62)  with  respect  to  s  and  setting  N(t,s)  :* 
|~(t,s),  it  can  be  seen  that 

<5F  +  ^ (t ' s)  3  r(t,s)N(t,s)  +  N(t,s)r(t,s) '  .  (3. 63a) 

In  view  of  (3.59),  this  partial  differential  equation  has  the  boundary 
condition 

N(t,0)  -  A(t,Po(t))  -  P0(t)  ,  (3.63b) 

which,  by  Lemma  3.8,  is  constant  for  PQ  e  P£.  Consequently,  (3.63)  can 
be  integrated  to  yield 


N(t,s)  -  UCt^JHCt.OJUCt.s)' 


> 


(3.64) 


where  U  is  given  by  (3.61b).  Consequently,  P  satisfies  (3.61).  But 
(3.61)  clearly  has  a  unique  solution,  and  therefore  the  lemma  follows.  □ 

Proof  of  Theorem  3.9.  Let  P  e  P  .  Then  A(t,P  (t))  -  P  (t)  s  o,  and 

0  0  0  0 

consequently,  by  (3.61a), 

ap 

ffct.s)  S  0  ,  (3.65) 

which,  in  view  of  (3.59)  and  (3.55),  implies  that  P(»,s)  £  P  for  all 

s  £  (-*,*),  i.e.,  (i)  holds.  Property  (ii)  is  an  immediate  consequence 

of  (3.65).  To  prove  (iii),  we  follow  [72],  First  note  that  (i),  (ii), 

and  (3.39)  imply  that,  for  each  t  e  JR,  s*+-  P(t,s)  is  a  nonincreasing 

function  bounded  from  below  by  P* (t) ,  and  consequently  P(t,s)  tends  to 

a  limit  P(t)  as  s  +  «.  It  remains  to  show  that  P  *  P„.  Keeping  t 

fixed  and  letting  s  •+■  *  in  (3.S9),  it  is  not  hard  to  see  that  P  satis- 
*  dp 

fies  the  differential  equation  »  A(t,P(t));  hence  P  e  P  .  Then,  by 
Lemma  3.6,  P  is  the  state  covariance  function  of  an  internal  realization; 
let  x  denote  its  state  process.  But,  then  x  satisfies  (3.54)  for  some 
family  (H(t);  t  £  3R}  of  projections,  i.e.,  x*  -  x  =*  II(x*  -  x*),  which 
yields 

P*(t)  -  P(t)  *  JI(t)[P*(t)  -  P*(t)]n(t)'  .  (3.66) 

To  see  this,  use  the  orthogonality  relations  of  Lemma  3.5  in  [64].  Now, 
use  the  fact  that  PQ  e  P_  and  P(t)  £  PQ(t)  f°r  a11  t  e  JR  (for  P(t,») 
is  a  nonincreasing  function),  to  see  that  P(t)  <  P*(t)  for  all  t  e  ]R. 
Consequently,  for  each  t  e  ]R, 
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it(t)[p*(tO  -  p*(t)]nct)*  >  o 

which  can  hold  only  if  H(t)  is  nonsingular.  But  lift)  is  a  projection, 

2 

i.e.,  IlCt)  »  II(t),'  which  together  with  nonsingularity  implies  that 
n(t)  =  I.  Hence,  it  follows  from  (3.66)  that  P  *  P#,  as  required.  The 
proof  of  (iv)  is  analogous.  □ 

We  are  now  ready  to  formulate  the  non-Riccati  algorithm  to 
generate  families  of  (wide  sense)  realizations.  As  all  realizations 
are  determined  by  the  matrix  B,  the  algorithm  will  be  given  in  terms  of 
this  parameter  only. 

Let  8  ■  {B  «  (B^,B2)  I  B^^  and  B2  are  given  by  (3.58)  as  P  ranges 
over  P}.  Let  8q,  8_  and  8+  be  defined  analogously  in  terms  of  PQ,  P_ 
and  P+  respectively.  It  is  clear  that  8q  consists  of  those  B  e  8  for 
which  B2  »  0.  In  particular,  let  B*  and  B*  denote  the  elements  of  8q 
corresponding  to  P*  and  P*. 

Theorem  3.12.  Assume  that  the  coo ariance  function  of  y  is  coercive. 

Let  [F,  B°,  H,  (R^O)]  be  an  arbitrary  (wide  sense)  realization  of  y 

such  that  B°  is  constant  on  (-®,00)  and  let  s  B(t,s)  =  [B^ftjS),  B2(t,s)] 

be  the  unique  solution  of  the  system 
3B 

*35<t,s)  =B2(t,s)B2(t,s)'H(t)»R(t)"Js  J  Bl(t,0)  *  B°(t)  (3.67a) 

(4+^)B2Ct>S)  *  [F(t)  -BiCt,s)R(t)’,*H(t)]B2(t,s)  ;  B2(t,0)-B®.  (3.67b) 

For  each  s  e  (-00,00).,  let  P(t,s)  be  the  unique  solution  of 

|£(t,s)  *F(t)P(t,s)  +P(t,s)F(t)'  +B(t,s)B(t,s)»  . 


(3.68) 
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Thent  for  each  s  e  (-*,*),  [F,  B(*,s),  H,  (R^.O)]  is  a  (wide  sense) 
realisation  of  y,  with  state  oooariance  function  P(«,s).  If  BQ  «  8_, 
for  sack  t  e  1R  t  B(t,s)  B*(t)  «  s  +  and  if  B°  e  for  each 
t  €  IR.,  B(t,s)  -►  B*(t)  as  s  -*•  Finc&ly t  the  function  P  satisfies 
aonditions  (i)  -  (iv)  of  Theorem  3.9  and  the  equation 

-  -B2(t,*)B2Ct,s)'  .  (3.69) 

Proof.  Let  P  be  the  state  covariance  function  of  the  initial  real- 
o 

realization  [F,  B°,  H,  (R**,0)]  and  let  s  **■  P(t,s)  be  the  trajectory 
through  PQ  defined  by  Theorem  3.9.  Define 

B^(t, s)  -  [G(t)  -  P(t,s)H(t)  ,]R(t)“Js  (3.70a) 

and 

B2(t,s)  «  U(t,s)B°  ,  (3.70b) 

where  U(t,s)  is  given  by  (3.61b).  Then,  |~<t,s)  «  U(t,s)N(t,0)U(t,s) '  * 
-U(t,s)B°B°'  U(t,s)  •  «  -B2(t,s)B2(t,s) ’.  This  proves  (3.69).  Differ¬ 
entiate  (3.70a)  with  respect  to  s  and  use  (3.69)  to  get  (3.67a).  To 
prove  (3.67b),  differentiate  (3.70b)  with  respect  to  s  and  use  (3.61b). 

In  view  of  (3. 59)  and  (3.69),  ^|{t,s)  *  -B2(t,s)B2(t,s) '  = 

A(t,P(t,s))  -  ||(t,s).  Hence  |^<t,s)  *  A(t,P(t,s))  +  B2(t,s)B2(t,s) ' 
which  is  (3.68).  Hence  (P(t,s),  B(t,s))  satisfy  (3.6)  and  (3.12),  and 
consequently  [F(t),  B(t,s),  H(t),  (R(t)  ,0]  is  a  realization  of  y  with 
state  covariance  P(t,s)  which  satisfies  conditions  (i)-(iv)  of  Theorem  3.9. 
Finally,  the  fact  that  B^(t,s)  B*(t)  (B*(t))  under  the  stated  condi¬ 

tions  follows  from  condition  (iii)  ((iv))  of  Theorem  3.9.  Since 

9P 

££<t,s)  -*•  0  as  s  +  +®(-®),  B2(t,s)  +  0  as  s  +  +°°(-») .  Hence  B(t,s) 

(B*(t) ,0)  ((B*(t) ,0))  as  s  +  -h»(^»).  D 


Remark.  This  theorem  has  the  following  interpretation.  Each  realiza¬ 
tion  (3.5)  in  S  gives  rise  to  a  family  of  realizations  indexed  by 

V 

S  €  (-*,«“) 

dxs  ■  F(t)xg(t)dt  ♦  B^(t,s)du  +  B2(t,s)dvs  (3.71a) 
dy  -  H(t)x  (t)dt  ♦  R(t)W  (3.71b) 

S  5 

which  are  totally  ordered  in  the  sense  that  the  state  covariances 
P(t,s)  »  E{xs(t)xs(t)  •}  of  Sg  satisfy  P(t,s2)  s  P^s^  for  s^  s  s2« 

If  B°  e  8  ,  this  family  will  contain  the  minimum-variance  reali- 
zation  S*,  and  if  B°  e  8^ ,  the  family  contains  the  maximum-variance 
realization  S*.  Finally,  if  B°  e  8q,  (3.71)  will  contain  only  one 
realization:  (3.5)  itself. 

3.6.  The  Singular  and  The  Mixed  Cases 

The  stochastic  realization  problem  may  be  classified  into  three 
categories: 

(i)  the  regular  case,  for  which  R(t)  is  positive  definite  for 
all  t  «  1R  , 

(ii)  the  singular  case,  for  which  R(t)  =  0,  and 

(iii)  the  mired,  case,  for  which  R(t)  *  0,  but  det  R(t)  »  0  for 
all  t  e  JR. 

(In  fact,  there  is  a  fourth  case,  which  we  shall  not  deal  with  here, 
i.e.,  the  case  for  which  det  R(t)  *  0  for  some,  hit  not  all  t  e  1R.) 

By  the  assumptions  made  in  Section  3.1,  we  have  just  studied  the 
first  of  these  cases.  The  analysis  used  depends  heavily  on  the  fact 
that  R(t)  is  invertible  for  all  t  e  IR.  To  solve  the  problems  of  the 
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last  two  categories,  we  shall  first  have  to  reformulate  them  in  the 
following  way:  the  given  realization  [F,  G,  H,  R]  will  be  converted 
to  an  equivalent  realization  (in  the  sense  of  the  definition  given 
below)  [F  ,  G  ,  H  ,  R  ]  where  Ra(t)  is  positive  definite  for  all 
t  e  R.  The  results  of  this  section  are  generalizations  of  similar 
results  presented  In  [18]. 

Definition  3.13.  ([18]).  The  two  realizations  [F^,  G^,  H^,  R^] 
and  [F2,  G2,  Hj,  R2]  are  said  to  be  ecfiivodent  if  they  have  the 
same  set  P. 

Let  us  start  with  the  singular  case,  i.e.,  let  R(t)  «  0  for 
all  t  e  K.  Then  the  equations  (3.6)  and  (3.12)  of  the  Positive 
Real  Lemma  become 


FP  +  PF'  ♦  BB’ 

(3.72a) 

G  -  PH’  «  0 

(3.72b) 

P  *  P’  >  0  . 

(3.72c) 

Proposition  3.14.  Let  the  entries  of  G  and  H  be  differentiable  at 
least  n  times.  Then ,  the  realizations  [F,  G,  H,  0]  and  [F,  G  ,  H ,  R  ], 

ft  ft  ft 

where 

G  *  G  -  FG  ,  (3.73a) 

ft 

H  »  HF  +  H  ,  (3.73b) 

ft 


and 
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Ra  »  HG  *•  G'H»  -  HFG  -  G'F'H'  (3.73c) 

are  equivalent  (in  the  sense  of  Definition  3. IS). 

Proof.  All  realizations  arising  from  the  quadruplet  [F,  G,  H,  0]  have 
the  form 

dx  -  Fxdt  +  Bdw  (3.74a) 

y  «  Hx  (3.74b) 

with  state  covariance  function  P,  which,  together  with  F,  G,  and  H  sat¬ 
isfy  equations  (3.72).  Let  y  *  Then,  using  relation  (3.74b),  the 

& 

following  is  a  realization  of  the  process  y 

cl 

dx  *  Fxdt  +  Bdw  {3.75a) 

dya  *  H  xdt  +  HBdw  .  (3.75b) 

where  H  is  given  by  (3.73b).  Observe  that  (3.74)  and  (3.75)  have  the 

same  state  process  x  and  hence  the  same  state  covariance  P.  Hence, 

these  realizations  of  y  and  y  have  the  same  set  P.  We  shall  show  that 

a 

all  realizations  obtained  using  the  quadruplet  [F,  G  ,  H  ,  R  J  are  of 

4  4  4 

the  form  (3.75).  First,  observe  that  G  as  given  by  (3.73a)  can  be 

cl 

written  as  G  *  PH*  +  BB'H’.  To  see  this,  we  use  the  following  sequence 
a 

of  equalities:  G  *  G  -  FG  =  PH*  +  PH*  -  FPH'  =  P[F'H*  +  H']  + 
a 

PH*  -  FPH'  -  PF'H'  =  PH^  +  BB'H',  which  is  the  G  that  corresponds 

to  (3.75),  obtained  from  the  Positive  Real  Lemma.  It  remains  to  show 

that  R  *  D  D'  where  D  =  HB.  This  is  shown  by  the  following  sequence 
a  a  a  a 


of  identities: 
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R  «  HG  -  G'H*  -  HFG  -  G»F‘H» 
a 


»  HPH»  -  HFPH*  -  HPF'H' 
»  H[P  -  FP  -  PF']H' 


»  «  D  D*  .  □ 

a  a 


Consequently,  by  the  above  proposition,  instead  of  studying  the 
quadruplet  [F,  G,  H,  0],  we  can  study  [F,  G  ,  H  ,  R  ].  If  R  *  0, 

a  a  a  a 

the  above  procedure  of  differentiating  the  output  may  be  repeated  until 

a  positive  definite  R  is  obtained.  If  R  *  0,  but  det  R  SO,  the  pro- 

sl  a  a 

cedure  of  the  third  category,  which  will  be  discussed  shortly,  may  be 
used. 

With  this  setup,  all  the  results  of  the  previous  sections  can  be 

carried  over  using  G  ,  H  and  R  instead  of  G,  H  and  0. 

a  a  a 

Next,  we  consider  case  (iii)  which  is  the  most  general.  Here  we 
need  more  assumptions  than  in  case  (ii).  Assume  R(t)  has  constant  rank 
for  all  t  e  ]R  and  the  entries  of  R(*)  are  differentiable  at  least  n 
times.  Set  k  :=  rank  R(t). 

Since  R(t)  *  0  but  det  R(t)  =»  0,  by  Dolezal’s  Theorem  (see 
e.g.  [73]),  there  exists  a  nonsingular  matrix  function  S,  the  entries 

A 

of  which  are  differentiable  at  least  n  times,  such  that  R  =  SRS  * 

nil  q 

q  0  with  Rj  a  positive  definite  k  x  k  matrix  function.  This  trans¬ 
formation  corresponds  to  a  change  of  basis  in  the  subspace  spanned  by  y: 
instead  of  examining  the  process  y,  we  examine  another  process  y  =  Sy. 
Without  loss  of  generality,  we  shall  assume  R  ■l?a  with  as  above. 
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D  in  the  following  manner: 
x  n  and  H2  is  (m  -  k)  x  n, 
x  k  and  is  n  x  (m  -  k)  and 
and  D2  is  (m  -  k)  x  p. 

Consequently,  equations  (3.6)  and  (3.12)  become 


*  FP  +  PF*  +  BB» 

(3.76a) 

Gx  -  PHj  -  BDj 

(3.76b) 

D1D1  *  Ri  >  0 

(3.7.6c) 

d2  -  0 

(3.76d) 

G,  -  PHi  ■  0 

(3.76e) 

P  «  P'  >0 

(3. 76f) 

The  following  is  the  generalized  version  of  Proposition  3.14. 


Partition  the  matrices  H,  G  and 

ds; 


H 


H„ 


with  Hj^  is  k 


G  ■  [G^,  G^]  with  Gj  is  n 

ns:, 

with  Dj  is  k  x  p 


Proposition  3.15.  Let  the  entries  of  G,  H  and  R  be  differentiable  at 
least  n  times  end  let  R(t)  be  of  constant  rank  for  all  t  e  IR.  Let 
Gj,  G2,  H2  and  R^  be  as  defined  above.  Then  the  realizations 
[Fj  G,  H,  R]  and  [F,  GaJ  RJ,  where 


and 


Ga  «  [GiG2  -  FG2]  . 
H, 


H  = 
a 


H2F  +  H2 


\  G1H2  '  H1G2 
H2G1  -  G2H;  H2G2  -  G2A2  -  H2FG2  "  G2F’H2 


(3.77a) 

(3.77b) 


(3.77c) 
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are  equivalent  (in  the  sense  of  Definition  3.13). 


Proof.  The  proof  is  an  immediate  consequence  of  that  of  Proposition  3.14 

[yi] 

],  where  y  has  been  partitioned  com- 

Vy7 


and  follows  upon  defining  y 


patibly  with  the  rest  of  the  matrices,  as  y 


□ 


If  R  obtained  above  is  not  full  rank,  we  repeat  the  above  proce- 

d 

dure  of  changing  the  basis  and  differentiating  the  component  of  the 
output  that  does  not  contain  a  white  noise  until  we  arrive  at  a  nonsingu¬ 
lar  Ra<  The  natural  question  that  arises  is  whether  this  procedure 
terminates  in  a  finite  number  of  steps.  Silverman  [74]  has  shown  that, 
subject  to  some  extra  regularity  conditions,  the  answer  is  yes. 

As  a  special  case,  we  may  quite  easily  obtain  all  the  results 
of  Germain  [18]  for  the  stationary  singular  and  mixed  cases.  These  re¬ 
sults  are  summarized  in  the  following  two  corollaries. 

Corollary  3.16.  Let  F,  G  and  H  be  oonstant  and  let  R  *  0.  Then  the 
quadruplets  [F,,  G,  H,  0]  and  [F,  -FG,  HF.,  -HFG  -  G'F’H’]  ca>e  equivalent. 


Corollary  3.17.  Let  F,,  G,  H  and  R  be  oonstant  and  let  R  *  0  but 

det  R  =  0.  Then  the  quadruplets  [F,  G,  H,  R]  and  [F,  G  ,  H  ,  R  ],  where 

cL  cl  cl 


Ga  =  [Gl  -  FG2]  ,  Ha  = 
[R, 


R  =■■ 
a 


H1 

h2f 


and 


GjH'  - 


H2G1  -  G2H1  -H2FG2  "  G2F’H2 


are  equivalent. 
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